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(54) Title: SECRETED POLYPEPTIDE SPFXTIRS ASSOCIATED WITH CARDIOVASCULAR DISORDERS 

(57) Abstract: The invention discloses human secreted polypeptides that circulate at an increased level in the plasma of patients with 
cardiovascular disorders. The invention also provides methods of using compositions including the polypeptides, polynucleotides 
encoding them, and antibodies specific for these polypeptides, for diagnosis, prognosis, and for drug development. 
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SECRETED POLYPEPTIDE SPECIES ASSOCIATED WITH CARDIOVASCULAR 

DISORDERS 

FIELD OF THE INVENTION 

The invention relates to polypeptide species secreted preferentially in individuals with 
cardiovascular disorders, to isolated polynucleotides encoding such polypeptides, to polymorphic 
variants thereof and to the use of said nucleic acids and polypeptides or compositions thereof in 
detection assays, for cardiovascular disorder diagnosis, and for drug development 

BACKGROUND 

Cardiovascular disease is a major health risk throughout the industrialized world. Coronary 
Artery Disease (CAD) is characterized by atherosclerosis or hardening of the arteries. Atherosclerosis 
is the most prevalent of cardiovascular diseases, is the principal cause of heart attack, stroke, and 
gangrene of the extremities, and thereby the principle cause of death in the United States. 
Atherosclerosis is a complex disease involving many cell types and molecular factors (described in, 
for example, Ross, 1993, Nature 362: 801-809). In normal circumstances a protective response to 
insults to the endothelium and smooth muscle cells (SMCs) of the wall of the artery consists of the 

I ne advanced lesions ©i atherosclerosis may occlude the artery concerned, and result from an 
excessive inflammatory-fibroproliferative response to numerous different forms of insult Injury or 
dysfunction of the vascular endothelium is a common feature of many conditions that predispose an 
individual to accelerated development of atherosclerotic cardiovascular disease. 

Atherosclerotic plaques occlude the blood vessel concerned and restrict the flow of blood, 
resulting in ischemia. Ischemia is a condition characterized by a lack of oxygen supply in tissues of 
organs due to inadequate perfusion. Such inadequate perfusion can have a number of natural causes, 
including atherosclerotic or restenotic lesions, anemia, or stroke. The most common cause of 
ischemia in the heart is atherosclerotic disease of epicardial coronary arteries. By reducing the lumen 
of these vessels, atherosclerosis causes an absolute decrease in myocardial perfusion in the basal state 
or limits appropriate increases in perfusion when the demand for flow is augmented. Coronary blood 
flow can also be limited by arterial thrombi, spasm, and, rarely, coronary emboli, as well as by ostial 
narrowing due to luetic aortitis. Congenital abnormalities, such as anomalous origin of the left 
anterior descending coronary artery from the pulmonary artery, may cause myocardial ischemia and 
infarction in infancy, but this cause is very rare in adults. 
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Myocardial ischemia can also occur if myocardial oxygen demands are abnormally increased, 
as in severe ventricular hypertrophy due to hypertension or aortic stenosis. The latter can be present 
with angina that is indistinguishable from that caused by coronary atherosclerosis. A reduction in the 
oxygen-carrying capacity of the blood, as in extremely severe anemia or in die presence of carboxy- 
5 hemoglobin, is a rare cause of myocardial ischemia. Not infrequently, two or more causes of ischemia 
will coexist, such as an increase in oxygen demand due to left ventricular hypertrophy and a reduction 
in oxygen supply secondary to coronary atherosclerosis. 

Extensive clinical studies have identified factors that increase fee risk of cardiovascular 
disorders. Some of these risk factors, such as age, gender, and family history cannot be changed 
10 Other risk factors include die following: smoking, high blood pressure, high fat and high cholesterol 
diet, diabetes, lack of exercise, obesity, and stress. 

Fortunately, many contributing factors are controllable through lifestyle changes. The risk of 
cardiovascular disorders for smokers is more than twice that of non-smokers. When a person stops 
smoking, regardless of how much he or she may have smoked in the past, their risk of developing a 
15 disorder rapidly declines. Serum cholesterol level is directly related to prevalence of cardiovascular 
disorder and hypertension or high blood pressure is an important risk factor. Physical activity has 
been postulated to reduce the risk of developing a cardiovascular disorder through various 

myocardial contraction and its electrical impulse staoHrty. Keoueea oxygen aemaaa ana myocardial 
20 work are reflected in lowered heart rate and blood pressure at rest. Physical activity also increases the 
diameter and dilatory capacity of coronary arteries, increases collateral artery formation, and reduces 
rates of progression of coronary artery atherosclerosis. Obesity and the serum fatty acids are reduced 
by activity. 

There may be no noticeable symptoms of a cardiovascular disorder at rest, but symptoms such 
25 as chest pressure may occur with increased activity or stress. Other first signs that can appear are 

heartburn, nausea, vomiting, numbness, shortness of breath, heavy cold sweating, unexplained fatigue, 
and feelings of anxiety. The more severe symptoms of cardiovascular disorders are chest pain (angina 
pectoris), rhythm disturbances (arrirythmias), stroke, or heart attack (myocardial infarction). Strokes 
and heart attacks result from a blocked artery in the brain and heart tissue, respectively. Because 
30 symptoms vary, the tests and treatments chosen can be very different from one patient to another. 

Diagnostic tests useful in determining the extent and severity of cardiovascular disorder 
include: electrocardiogram {EKG), stress test, nuclear scanning, coronary angiography, resting EKG, 
EKG Multiphase Information Diagnosis indexes, Holter monitor, late potentials, EKG mapping, 
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echocardiogram, Thallium scan, PET, MRI, CT, angiogram and IVUS. Additional risk factor 
measures and useful diagnostics are common and best applied by one of skill in the art of medicine. 
There are many different therapeutic approaches, depending on the seriousness of the disease. For 
many people, cardiovascular disorders are managed with lifestyle changes and medications. More 
severe diagnoses may indicate a need for surgery. 

Surgical approaches to the treatment of ischemic atherosclerosis include bypass grafting, 
coronary angioplasty, laser angioplasty, atherectomy, endarterectomy, and percutaneous translumenal 
angioplasty (PCTA). The failure rate after these approaches due to restenosis, in which the occlusions 
recur and often become even worse, is extraordinarily high (30-50%). It appears that much of the 
restenosis due to further inflammation, smooth muscle accumulation, and thrombosis. Additional 
therapeutic approaches to cardiovascular disease have included treatments that encouraged 
angiogenesis in such conditions as ischemic heart and limb disease. 

The non-specific nature of most CAD and cardiovascular disorder symptoms makes definitive 
diagnosis difficult. More quantitative diagnostic methods suffer from variability, both between 
individuals and between readings on a single individual. Thus, diagnostic measures must be 
standardized and applied to individuals with well-documented and extensive medical histories. 
Further, current diagnostic methods often do not reveal the underlying cause for a given observation 

ttuuittoct uic wauuxiti if i* yiuoitiu iaiiu amy tvcii ocJ LULttttiiLt tv uiu iiiujviuuai. 

Methods of diagnosis that rely on nucleotide detection include genetic approaches and 
. expression profiling. For example, genes that are known to be involved in cardiovascular disorders 
may be screened for mutations using common genotyping techniques such as sequencing, 
hybridization-based techniques, or PCR. In another example, expression from a known gene may be 
tracked by standard techniques including RTPCR, various hybridization-based techniques, and 
sequencing. These strategies often do not enable a practitioner to detect differences in rnRNA 
processing and splicing, translation rate, mRNA stability, and posttranslational modifications such as 
proteolytic processing, phosphorylation, glycosylation, and amidation. 

To address the current weaknesses in the diagnostic state of the art for cardiovascular 
disorders, the invention provides specific plasma polypeptides that are differentially increased in 
plasma from individuals with Coronary Artery Disease compared to control plasma. By providing the 
actual polypeptide species, differences in mRNA processing and splicing, translation rate, mRNA 
stability, and posttranslational modifications such as proteolytic processing, phosphorylation, 
glycosylation, and amidation are revealed. The polypeptides of the invention are thus described as 
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"Cardiovascular disorder Plasma Polypeptides" or CPPs, These polypeptide sequences are described 
as SEQ ID NOs: 1-2, 5-6, 13, and 20-21, and those comprising at least one of the amino acid 
sequences selected from SEQ ID NOs: 3-4, 7-12, 14-19 and 22-25 (see Figures 1-4). These 
polypeptides are more precisely referred to as "Cardiovascular disorder Plasma Polypeptides 12, 15, 
16, and 18" (CPP 12 corresponding to SEQ ID NOs: 1-4, CPP 15 corresponding to SEQ ID NOs:5-12, 
CPP 16 corresponding to SEQ ID NOs:13-19, and CPP 18 corresponding to SEQ ID NOs:20-25), and 
include fragments, and post-translationally modified species of CPPs that are present at a higher level 
m plasma obtained from individuals with Coronary Artery Disease (CAD). Preferred fragments of the 
invention are those described as SEQ ID NOs: 3-4, 7-12, 14-19 and 22-25. 

The present invention discloses "Cardiovascular disorder Plasma Polypeptides" (CPPs), 
fragments, and post-translationally modified species of CPPs that are present at a higher level in 
plasma obtained from individuals with Coronary Artery Disease (CAD). Thus, the CPPs of the 
invention represent an important diagnostic tool for detennining the risk of CAD, coronary heart 
disease (CHD), peripheral vascular disease, cerebral ischemia (stroke), congestive heart failure, 
atherosclerosis, hypertension, and other cardiovascular diseases. CPPs are secreted factors and, as 
such, are readily detectable and useful for drug development, diagnosis, and prevention of 
cardiovascular disease. 

The present invention is directed to compositions related to secreted polypeptide species that 
are preferentially increased in plasma from individuals with a cardiovascular disorder. These 
polypeptide species are designated herein "Cardiovascular disorder Plasma Polypeptides," or CPPs. 
Such Cardiovascular disorder Plasma Polypeptides comprise an amino acid sequence selected from 
one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25, and are designated CPP 12, 
CPP 15, CPP 16, and CPP 18, respectively. Compositions include CPP precursors, antibodies specific 
for CPPs, including monoclonal antibodies and other binding compositions derived therefrom. 
Further included are methods of making and using these compositions. Precursors of the invention 
include unmodified precursors, proteolytic precursors of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25, 
and intermediates resulting from alternative proteolytic sites in the ammo acid sequences of SEQ ID 
NOs: 1-4, 5-12, 13-19 and 20-25. 

A preferred embodiment of the invention includes CPPs having a posttranslational 
modification, such as a phosphorylation, glycosylation, acetylarion, amidation, or a C-, N- or O- 
Hnked carbohydrate group. Additionally preferred are CPPs with intra- or inter-molecular 
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interactions, e.g., disulfide and hydrogen bonds that result in higher order structures. Also preferred 
are CPPs that result from differential mRNA processing or splicing. Preferably, the CPPs represent 
posttranslationally modified species, structural variants, or splice variants that are present in plasma 
from individuals with a cardiovascular disorder. 
5 hi another aspect, the invention includes CPPs comprising a sequence which is at least 75 

percent identical to a sequence selected from the group consisting of SEQ ID NOs: 1-25. Preferably, 
the invention includes polypeptides comprising at least 80 percent, and more preferably at least 85 
percent, and still more preferably at least 90 percent, identity with any one of the sequences selected 
from SEQ ID NOs: 1-25. Most preferably, the invention includes polypeptides comprising a sequence 

10 at least 95 percent identical to a sequence selected from the group consisting of SEQ ID NOs: 1 -25 . 

M another aspect, the invention includes natural variants of CPPs having a frequency in a 
selected population of at least two percent. More preferably, such natural variant has a frequency in a 
selected population of at least five percent, and still more preferably, at least ten percent Most 
preferably, such natural variant has a frequency in a selected population of at least twenty percent 

1 5 The selected population may be any recognized population of study in the field of population 

genetics. Preferably, the selected population is Caucasian, Negroid, or Asian. More preferably, the 
selected population is French, German, English, Spanish, Swiss, Japanese, Chinese, Irish, Korean, 
Singaporean, Icelandic. North American T<?r*f*1i ArnV> TWVich Ot- P ~v T+^i^r. p*>HqK p^*fv 

20 population is Icelandic, Saarni, Finnish, French of Caucasian ancestry, Swiss, Singaporean of Chinese 
ancestry, Korean, Japanese, Quebecian, North American Pima Indians, Pennsylvanian Amish and 
Amish Mennonite, Newfoundlander, or Polynesian. 

A preferred aspect of the invention provides a composition comprising an isolated CPP, i.e., a 
CPP free from proteins or protein isoforms having a significantly different isoelectric point or a 

25 significantly different apparent molecular weight from the CPP. The isoelectric point and molecular 
weight of a CPP may be indicated by affinity and size-based separation chromatography, 2- 
dimensional gel analysis, and mass spectrometry. 

In a preferred aspect, the invention provides particular polypeptide species that comprise an 
amino acid sequence selected from one of the groups consisting of SEQ ID NOs: 3-4, 7-12, 14-19 and 

30 22-25. Preferably, the particular polypeptide species further comprises contiguous amino acid 

sequence from SEQ ID NOs: 1-2, 5-6, 13, and 20-21, respectively. Preferred species are polypeptides 
that i) comprise an amino acid sequence selected from one of the groups consisting of SEQ ID NOs: 
3-4, 7-12, 14-19 and 22-25; it) appear at a higher level in plasma from individuals with a 
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cardiovascular disorder; and iii) optionally result from proteolytic processing of the polypeptide of 
SEQ ID NOs: 1-2, 5-6, 13, and 20-21, respectively. 

In a still further aspect, the invention provides a combination of two or more of the 
polypeptides selected from the one of the groups consisting consisting of SEQ ID NOs: 1-4, 5-12, 13- 
5 19, and 20-25. 

hi an additional aspect, the invention includes modified CPPs. Such modifications include 
protecting/blocking groups, linkage to an antibody molecule or other cellular ligand, and detectable 
labels, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation 
of the protein. Chemical modifications may be carried out by known techniques, including but not 

10 limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 
protease, NaBH4, acetylation, formylation, oxidation, reduction, or metabolic synthesis in the 
presence of timicamycin. . 

Also provided by the invention are chemically modified derivatives of the polypeptides of the 
invention which may provide additional advantages such as increased solubility, stability and 

1 5 circulating time of tine polypeptide, or decreased immunogenicity (e.g., water soluble polymers such 
as polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxyrnethylcelhilose, 
dextran, polyvinyl alcohol). The CPPs are modified at random positions within the molecule, or at 

ViXCaiiCUl SUV1CLLC6. 

20 In another embodiment, the invention provides a method of identifying a modulator of at least 

one CPP biological activity comprising the steps of: i) contacting a test modulator of a GPP biological 
activity with the polypeptide comprising the amino acid sequence selected from one of the groups 
consisting of the amino acid sequences listed in Table 1 (corresponding to CPP 12, CPP 15, CPP 16 
and CPP 18); ii) detecting the level of said CPP biological activity; and iii) comparing the level of 

25 said CPP biological activity to that of a control sample lacking said test modulator. Where the 
difference in the level of CPP protein biological activity is a decrease, the test modulator is an 
inhibitor of at least one CPP biological activity. Where the difference in the level of CPP biological 
activity is an increase, the test substance is an activator of at least one CPP biological activity. 
In another aspect of the invention, a method of identifying a modulator of a 

30 cardiovascular disorder is provided, which comprises the steps of: (a) administering a candidate agent 
to a non-human test animal which is predisposed to be affected or which is affected by the 
cardiovascular disorder, (b) adrninistering the candidate agent of (a) to a matched control non-human 
animal not predisposed to be affected or not being affected by the cardiovascular disorder; (c) 
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detecting and /or quantifying the level of at least one polypeptide in a biological sample obtained from 
the non-human test animal of step (a) and from the control animal of step (b), wherein the at least one 
polypeptide is selected from: (i) a polypeptide comprising the amino acid sequence selected from one 
of the groups consisting of SEQ IDNOs: 1-2, 5-6, 13, and 20-21; (ii) a variant, with at least 75% 
sequence identity, having one or more amino acid substitutions, deletions or insertions relative to the 
amino acid sequence shown in SEQ 3D NOs: 1, 2, 5, 6, 13, 20, and 21; and (iii) a. fragment of a 
polypeptide as defined in 0 or ii) above which is a least ten amino acids long; and step (d) comparing 
the levels of the polypeptide of step (c); wherein a displacement of the level of the polypeptide in the 
biological sample obtained from the non-human test animal towards the level of the polypeptide in the 
biological sample obtained from the control animal indicates mat the candidate agent is a modulator 
of the cardiovascular disorder. In a further embodiment of the invention the polypeptide level(s) is 
detected / quantified in combination with the level(s) of one or more of the following polypeptides: 
CPP 2, CPP8, CPP 9, CPP 13, CPP 14, CPP 17, CPP 19, CPP 20, CPP 40, CPP 41, CPP 149, CPP 
150, CPP 151, CPP 501, CPP 502, CPP 503, CPP 504, CPP 505, CPP 506, CPP 507, CPP 508, CPP 
509. A preferred embodiment of the invention provides that the non-human test animal which is 
predisposed to be affected or which is affected by the cardiovascular disorder comprises an increased 
plasma level of at least one polypeptide selected from (i) a polypeptide comprising the amino acid 

V&naiH, W1IH aL iVit&l jJ /Q SOC jU GULxZ iUCfliity s it* vujg One ta. tuviG axioms trwu oui/jaiUuoiu, u^iutio-ito ux 

insertions relative to the amino acid sequence shown in SEQ ID NOs: 1, 2, 5, 6, 13, 20, and 21; and 
(iii) a fragment of a polypeptide as defined in i) or ii) above which is a least ten amino acids long. 



Another embodiment of the invention relates to a non-human test animal which further comprises an 
alteration in the plasma level of one or more of the following polypeptides: CPP 2, CPP 8, CPP 9, 
CPP 13, CPP 14, CPP 17, CPP 19, CPP 20, CPP 40, CPP 41, CPP 149, CPP 150, CPP 151, CPP 501, 
CPP 502, CPP 503, CPP 504, CPP 505, CPP 506, CPP 507, CPP 508, CPP 509. 

hi another aspect, the invention includes polynucleotides encoding a CPP of the invention, 











1 



groups consisting of SEQ ID NGs: 1-4, 5-12, 13-19 and 20-25, oligonucleotides complementary to 
CPP gene sequences for diagnostic and analytical assays (e.g., PCR, hybridization-based techniques), 
and vectors for expressing CPPs. 

In another aspect, the invention provides a vector comprising DNA encoding a CPP. The 
invention also includes host cells and transgenic non-human animals comprising such a vector. There 
is also provided a method of making a CPP or CPP precursor. One preferred method comprises the 



7 



WO 2004/106941 



PCT/EP2004/005786 



steps of (a) providing a host cell containing an expression vector as disclosed above; (b) culturing the 
host cell under conditions whereby the DNA segment is expressed; and (c) recovering the protein 
encoded by the DNA segment. Another preferred method comprises the steps of: (a) providing a host 
cell capable of expressing a CPP; (b) culturing said host cell under conditions that allow expression of 
said CPP; and (c) recovering said CPP. Within one embodiment the expression vector further 
comprises a secretory signal sequence operably linked to the DNA segment, the cell secretes the 
protein into a culture medium, and the protein is recovered from the medium. An especially preferred 
method of making a CPP includes chemical synthesis using standard peptide synthesis techniques, as 
described in the section titled "Chemical Manufacture of CPP Compositions" and in Example 2. 

In another aspect, the invention includes isolated antibodies specific for any of the 
polypeptides, peptide fragments, or peptides described above. Preferably, the antibodies of the 
invention, are monoclonal antibodies. Further preferred are antibodies that bind to a CPP exclusively, 
that is, antibodies that do not recognize other polypeptides with high affinity. Anti-CPP antibodies 
have purification, diagnostic and prognostic applications. Preferred anti-CPP antibodies for 
purification and diagnosis are attached to a label group. Preferred CPP-related disorders for diagnosis 
include coronary artery disease (CAD), coronary heart disease (CHD), peripheral vascular disease, 
cerebral ischemia (stroke), congestive heart failure, atherosclerosis, hypertension, and other 

annoooies or anaoody-oen vea oomposttioifcr spttcmv xor a err ann^ea. uia&uu&uv meuious lor 
detecting CPPs in specific tissue samples and biological fluids (preferably plasma), and for detecting 
levels of expression of CPPs in tissues, also form part of the invention. Compositions comprising 
one or more antibodies described above, together with a pharmaceutically acceptable carrier are also 
within the scope of the invention, for example, for in vivo diagnosis and drug screening assays. 

The invention further provides methods for diagnosis of cardiovascular disorders that 
comprise detecting in a sample of body fluid, preferably blood plasma, the presence or level of at least 
one CPP disclosed herein or any combination thereof. Further included are methods of using CPP 
compositions, including primers complementary to CPP genes and/or messenger RNA and anti- CPP 
antibodies, for detecting and measuring quantities of CPPs in tissues and biological fluids, preferably 
plasma. These methods are also suitable for clinical screening, prognosis, monitoring the results of 
therapy, identifying patients most likely to respond to a particular therapeutic treatment, drug 
screening and development, and identifying new targets for drug treatment 

A still further aspect of the invention relates to a method for monitoring the efficacy of a 
treatment of a subject having or at risk of developing a cardiovascular disorder with an agent, which 
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comprises the steps of: (a) obtaining a pre-administration biological sample from the subject prior to 
administration of the agent; (b) detecting and /or quantifying the level of at least one polypeptide in 
the biological sample from said subject, wherein the at least one polypeptide is selected from (i) a 
polypeptide comprising the amino acid sequence selected from one of the groups consisting of SEQ 
5 ID NOs: 1-2, 5-6, 13, and 20-21; (ii) a variant, with at least 75% sequence identity, having one or 
more amino acid substitutions, deletions or insertions relative to the amino acid sequence shown in 
SEQ ID NOs: 1, 2, 5, 6, 13, 20, and 21; and (iii) a fragment of a polypeptide as defined in i) or ii) 
above which is a least ten amino acids long; and which comprises steps (c) obtaining one or more 
post-administration biological samples from the subject; (d) detecting the level of the at least one 

10 polypeptide in the post-administration sample or samples; (e) comparing the level of the at least one 
polypeptide in the pre-administration sample with the level of the at least one polypeptide in the post- 
administration sample; and (f) adjusting the administration of the agent accordingly. In a further 
embodiment of the invention the polypeptide level(s) is detected / quantified in combination with the 
level(s) of one or more of the following polypeptides: CPP 2, CPP 8, CPP 9, CPP 13, CPP 14, CPP 

15 17, CPP 19, CPP 20, CPP 40, CPP 41, CPP 149, CPP 150, CPP 151, CPP 501, CPP 502, CPP 503, 
CPP 504, CPP 505, CPP 506, CPP 507, CPP 508, CPP 509. 

The invention provides kits mat may be used in the above-recited methods and that may 

20 assays for the identification of new diagnostic and/or therapeutic agents. 

The invention further includes methods of using CPP-modulating compositions to prevent or 
treat disorders associated with aberrant expression or processing of CPPs of SEQ ID NOs: 1-4, 5-12, 
13-19 and 20-25 in an individual. Preferred CPP-related disorders include coronary artery disease 
(CAD), coronary heart disease (CHD), peripheral vascular disease, cerebral ischemia (stroke), 

25 congestive heart failure, atherosclerosis, hypertension, and other cardiovascular diseases. A preferred 
embodiment of the invention is a method of preventing or treating a CPP-related disorder in an 
individual comprising the steps of: determining that an individual suffers from or is at risk of a CPP- 
related disorder and introducing a CPP-modulating composition to said individual. 

hi one embodiment, Coronary Artery Disease (CAD) is defined by the appearance of at least 

30 one symptom. Such symptoms become more serious as the disease progresses. CAD is often 

accompanied by reduced left ventricle capacity or output Early CAD symptoms include elevated 
plasma levels of cholesterol and low-density lipoprotein (especially oxidized forms), as well as 
platelet-rich plasma aggregations. The vascular endothelium responds to inflammation and thus 
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15 



formation of plaques and levels of inflammatory and fibrinogenic factors increase. In addition, CAD, 



or atherosclerosis, is characterized by vascular calcification and hardening of the arteries. The 
resulting partial occlusion of the blood vessels leads to hypertension and ischemic heart disease. 
Eventual complete vascular occlusion results in myocardial infarction, stroke, or gangrene. 

In a preferred embodiment, detection of increased plasma levels of at least one CPP of the 
invention indicates an increased risk that an individual will develop CAD. Preferably, said detection 
indicates that an individual has at least a 1 .05-fold, 1. 1 -fold, 1 .15-fold, and more preferably at least a 
1.2-fold increased likelihood of developing CAD. Alternatively, detection of increased plasma levels 
of at least one CPP of the invention indicates that an individual has CAD. The amount of CPP 
increase observed in an individual compared to a control sample will correlate with the certainty of 
the prediction or diagnosis of CAD. As individual plasma CPP levels will vary depending on family 
history and other risk factors, each will preferably be examined on a case-by-case basis, hi preferred 
embodiments, CPP is detected in a human plasma sample by the methods of the invention. Especially 
preferred techniques are mass spectrometry and immunodetection. Preferably, a prediction or 
diagnosis of CAD is based on at least a Li-, 1.15-, L2-, 1. 25-, and more preferably a 1.5 -fold increase 
in the experimental CPP level as compared to the control. 
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20 BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

SEQ ID NO:l describes the amino acid sequence of Defensin 5 precursor, whereas SEQ ID 
NO:2 is die polypeptide sequence of the mature peptide (CPP 12). 

SEQ ID NOs:3 and 4 are the amino acid sequences of tryptic peptides found by tandem mass 
spectrometry at a higher level in plasma samples of individuals with coronary artery disease. 
25 SEQ ID NO: 5 describes the amino acid sequence of the CD59 precursor, whereas SEQ ID 

NO:6 represents die mature polypeptide (CPP 15). 

SEQ ID NOs:7-12 are the amino acid sequences of tryptic peptides found by tandem mass 
spectrometry at a higher level in plasma samples of individuals with coronary artery disease. 
SEQ ID NO: 13 describes the amino acid sequence of Flavin Reductase (CPP 16). 
30 SEQ ID NOs:14-17 and 19 are the amino acid sequences of tryptic peptides found by MS-MS 

mass spectrometry at an elevated level in plasma samples of individuals with coronary artery disease. 
SEQ ID NO: 18 is the amino acid sequence of the tryptic peptide found in control plasma. 

SEQ ID NO:20 describes the amino acid sequence of Microfibril Associated Glycoprotein 
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(MAGP)-2 precursor, whereas SEQ ID NO:21 is the polypeptide sequence of the mature protein 
(CPP18). 

SEQ ID NOs: 22-25 are the amino acid sequences of tryptic peptides found by tandem mass 
spectrometry at an elevated level in plasma samples of individuals with Coronary Artery Disease. 



Figure 1 shows the sequence of CPP 12 full length (SEQ ID NOs:l and 2) and peptide 
sequences found by MS-MS mass spectrometry in the plasma of individuals with coronary artery 
disease (SEQ ID NOs: 3 and 4). The tryptic peptides observed by tandem mass spectrometry are 
underlined in SEQ ID NOsrl and 2. The signal peptide is double-underlined in SEQ ID NO:l. 

Figure 2 shows the sequence of CPP 15 (SEQ ID NO:6), the sequence of the protein before 
maturation (SEQ ID NO:5) and the peptide sequences found by tandem mass spectrometry in the 
plasma of individuals with coronary artery disease (SEQ ID NOs: 7- 12). The tryptic peptides observed 
by tandem mass spectrometry are underlined in SEQ ID NOs:5 and 6. In SEQ ID NO:5, the signal 
peptide is double-underlined and the C terminal propeptide is italicized and double-underlined 

FigureS shows the sequence of CPP 16 full length (SEQ ID NO: 13) and peptide sequences 
found by MS-MS mass spectrometry in the plasma of individuals with coronary artery disease (SEQ 

UUUCi xiiii?U ill &UK^ kU l\\J.l^}. 

Figure 4 shows the sequence of CPP 1 8 (SEQ ID NO:21), of its precursor (SEQ ID NO:20) 
and of the peptide sequences found by tandem mass spectrometry in the plasma of individuals with 
Coronary Artery Disease (SEQ ID NOs: 22 -25). The tryptic peptides observed by tandem mass 
spectrometry are underlined in SEQ ID NOs:20 and 21 . In SEQ ID NO:20, the signal peptide is 
double-underlined. 



The present invention described in detail below provides methods, compositions, and kits 
useful for screening, diagnosis and prognosis of a cardiovascular disorder in a mammalian individual; 
for identifying individuals most likely to respond to a particular therapeutic treatment; for monitoring 
the results of cardiovascular disorder therapy, for screening CPP modulators, and for drug 



a non-human mammal, but is preferably human, more preferably a human adult. For clarity of 
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disclosure, and not by way of limitation, the invention will be described with respect to the analysis of 
blood plasma samples. However, as one skilled in the art will appreciate, the assays and techniques 
described below can be applied to other biological fluid samples (e.g. cerebrospinal fluid, lymph, bile, 
serum, saliva or urine) or tissue samples from an individual at risk of having or developing a 
cardiovascular disorder. The methods and compositions of the present invention are useful for 
screening, diagnosis and prognosis of a living individual, but may also be used for postmortem 
diagnosis in an individual, for example, to identify family members who are at risk of developing the 
same disorder. 

Definitions 

As used herein, the term Nucleic acids" and "nucleic acid molecule" is intended to include 
DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of the 
DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can be single-stranded 
or double-stranded, but preferably is double-stranded DNA Throughout the present specification, the 
expression "nucleotide sequence" may be employed to designate indifferently a polynucleotide or a 
nucleic acid. More precisely, the expression 'Nucleotide sequence" encompasses the nucleic material 
itself and is thus not restricted to the sequence information (i.e. the succession of letters chosen 

used interchangeably herein are terms "nucleic acids", "oligonucleotides", and "polynucleotides ". 

An "isolated" nucleic acid molecule is one which is separated from other nucleic acid 
molecules which are present in the natural source of the nucleic acid. Preferably, an "isolated" nucleic 
acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at die 5' and 3* 
ends of die nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. 
For example, in various embodiments, the isolated CPP nucleic acid molecule can contain less than 
about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the 
nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material, or culture medium when produced by recombinant techniques, or substantially free 
of chemical precursors or other chemicals when chemically synthesized. Using all or a portion of the 
nucleic acid as a hybridization probe, CPP nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, 
T. Molecular Cloning. A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). 
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As used herein, the term "hybridizes to" is intended to describe conditions for moderate 
stringency or high stringency hybridization, preferably where the hybridization and washing 

r 

conditions permit nucleotide sequences at least 60% homologous to each other to remain hybridized 
to each other. Preferably, the conditions are such that sequences at least about 70%, more preferably 
at least about 80%, even more preferably at least about 85%, 90%, 95% or 98% homologous to each 
other typically remain hybridized to each other. Stringent conditions are known to those skilled in the 
art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. In a preferred, non-limiting example, stringent hybridization conditions for nucleic acid 
interactions are as follows: the hybridization step is realized at 65°C in the presence of 6 x SSC 
buffer, 5 x Denhardt's solution, 0,5% SDS and lOOug/ml of salmon sperm DNA. The hybridization 
step is followed by four washing steps: 

- two washings during 5 min, preferably at 65°C in a 2 x SSC and 0. 1%SDS buffer; 
-one washing during 30 min, preferably at 65°C in a 2 x SSC and 0. 1% SDS buffer, 

- one washing during 10 min, preferably at 65°C in a 0.1 x SSC and 0.1%SDS buffer, 
these hybridization conditions being suitable for a nucleic acid molecule of about 20 nucleotides in 
length. It will be appreciated that die hybridization conditions described above are to be adapted 
according to the length of the desired nucleic acid, following techniques well known to the one skilled 

Oxford; and Current Protocols in Molecular Biology. 

*Tercent homology" is used herein to refer to both nucleic acid sequences and amino acid 
sequences. Amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid 
''homology*. To determine the percent homology of two amino acid sequences or of two nucleic 



nucleic acid sequence and non-homologous sequences can be disregarded for comparison purposes). 
The length of a reference sequence aligned for comparison purposes is at least 30%, preferably at 
least 40%, more preferably at least 50%, even more preferably at least 60%, and even more preferably 
at least 70%, 80%, 90% or 95% of the length of the reference sequence. The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that position. 
The percent homology between the two sequences is a function of the number of identical positions 
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shared by the sequences (i.e., % homology=# of identical positions/total # of positions 100). 

Hie comparison of sequences and determination of percent homology between two sequences 
can be accomplished using a mathematical algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of Karlin and 
Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul (1993) 
Proc. Natl. Acad. Sci. USA 90:5873-77, the disclosures of which are incorporated herein by reference 
in their entireties. Such an algorithm is incorporated into the NBLAST and XBLAST programs 
(version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences 
homologous to the sequences of the invention. BLAST protein searches can be performed wife the 
XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to the 
polypeptide sequences of the invention. To obtain gapped alignments for comparison purposes, 
Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Research 
25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be used. See 

http://www.ncbi jilrruiih.gov, the disclosures of which are incorporated herein by reference in their 
entireties. Another preferred, non-limiting example of a mathematical algorithm utilized for the 



which are incorporated herein by reference in their entireties. Sucrt an algorithm is incorporated into 
the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. 
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The term '"polypeptide" refers to a polymer of amino acids without regard to the length of the 
polymer; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. 
This term also does not specify or exclude post-translatiortal modifications of polypeptides, for 
example, polypeptides which include the covalent attachment of glycosyl, acetyl, phosphate, amide, 
lipid, carboxyl, acyl, or carbohydrate groups are expressly encompassed by the term polypeptide. 
Also included within the definition are polypeptides which contain one or more analogs of an amino 
acid (including, for example, non -naturally occurring amino acids, amino acids which only occur 
naturally in an unrelated biological system, modified amino acids from mammalian systems etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

The term "protein" as used herein may be used synonymously with the term "polypeptide" or 
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nay refer to, in addition, a complex of two or more polypeptides which nay be linked by bonds other 
than peptide bonds, for example, such polypeptides making up the protein may be linked by disulfide 
bonds. The term "protein" may also comprehend a family of polypeptides having identical amino 
acid sequences but different post-translaticmal modifications, particularly as may be added when such 
5 proteins are expressed in eukaryotic hosts. 

An "isolated" or "purified" protein or biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from the cell or tissue source from which the 
protein of the invention (i.e., CPP or biologically active fragment thereof) is derived, or substantially 
free from chemical precursors or other chemicals when chemically synthesized. The language 

10 "substantially free of cellular material" includes preparations of a protein according to the invention in 
which the protein is separated from cellular components of the cells from which it is isolated or 
recombinant^ produced. In one embodiment, the language "substantially free of cellular material" 
includes preparations of a protein according to the invention having less than about 30% (by dry 
weight) of protein other than the protein of the invention (also referred to herein as a "contaminating 

15 protein"), more preferably less than about 20% of protein other than the protein according to the 

invention, still more preferably less than about 10% of protein other than the protein according to the 
invention, and most preferably less than about 5% of protein other than the protein according to the 

recommnanuy proouceo, u is aiso prereraoiy buoaamimtty irec m culture meaium, i.e., cuicure 
20 medium represents less than about 20%, more preferably less than about 10%, and most preferably 
less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of a protein of the invention in which the protein is separated from chemical precursors 
or other chemicals which are involved in the synthesis of the protein. In one embodiment, the 
25 language "substantially free of chemical precursors or other chemicals" includes preparations of a 
protein of the invention having less than about 30% (by dry weight) of chemical precursors or non- 
protein chemicals, more preferably less than about 20% chemical precursors or non-protein 
chemicals, still mere preferably less than about 10% chemical precursors or non-protein chemicals, 
and most preferably less than about 5% chemical precursors or non-protein chemicals. 
30 The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 

artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 

15 
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The term "Cardiovascular disorder Plasma Polypeptide" or "CPP" refers to a polypeptide 
comprising the sequence described by any one of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. Such 
polypeptide may be post-translationally modified as described herein. CPPs may also contain other 
structural or chemical modifications such as disulfide linkages or amino acid side chain interactions 
5 such as hydrogen and amide bonds that result in complex secondary or tertiary structures. CPPs also 
include mutant polypeptides, such as deletion, addition, swap, or truncation mutants, fusion 
polypeptides comprising such polypeptides, and polypeptide fragments of at least three, but 
preferably, and where applicable, 8, 10, 12, 15, or 21 contiguous amino acids of the sequence of SEQ 
ID NOs: 1-4, 5-12, 13-19 and 20-25. Further included are CPP proteolytic precursors and 

10 intermediates of the sequence selected from the group consisting of SEQ ID NOs: 1-4, 5-12, 13-19 
and 20-25. The invention embodies polypeptides encoded by the nucleic acid sequences of CPP 
genes or CPP mRNA species, preferably human CPP genes and mRNA species, including isolated 
CPPs consisting of, consisting essentially of, or comprising the sequence of SEQ ID NOs: 1-4, 5-12, 
13-19 and 20-25. Preferred CPPs have a sequence comprising one of the sequences of SEQ CD NOs: 

15 1-2, 5-6, 13, and 20-21. Preferred CPP fragments have a sequence comprising one of die sequences of 
SEQ ID NOs: 3-4, 7-12, 14-19 and 22-25. Preferred CPPs retain at least one biological activity of 
CPPs of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. 

Ctr. loess inciuoe outare not lumteu *o; \±/ intiicaang mat an mamauai ms or wui mvc a 
20 cardiovascular disorder, (2) circulating through the bloodstream of individuals with a cardiovascular 
disorder; (3) antigenicity, or the ability to bind an anti-CPP specific antibody; (4) immunogenicity, or 
the ability to generate an anti-CPP specific antibody; and for CPP 12: (5) forming intermolecular 
amino acid side chain interactions such as hydrogen, amide, or preferably disulfide links; (6) 
interaction with a CPP target molecule, preferably a bacterial endotoxin; (7) neutralizing bacterial 
25 endotoxins; (8) promoting mast cell chemo taxis; (9) undergoing posttranslational processing, for 

example, specific proteolysis; and (10) functioning as an antimicrobial defense in the gut and female 
reproductive tract; for CPP 15: (5) forming intermolecular amino acid side chain interactions such as 
hydrogen, amide, or preferably disulfide links; (6) interaction with a CPP target molecule, preferably 
complement protein C9; {7) inhibiting complement lysis; and (8) undergoing posttranslational 
30 processing, for example, specific proteolysis and glycosylate on; for CPP 16: (5) interaction with a 
CPP target molecule, preferably NAD/ NADP; and for CPP 1 8: (5) having a posttranslational 
modification, such as glycosyiation; and (6) interaction with a CPP target molecule, preferably an 
integrin, microfibril or fibrillin. 
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As used herein, a "CPP modulator" is a molecule (e.g., polynucleotide, polypeptide, small 
molecule, or antibody) that is capable of modulating (Le., increasing or decreasing) either the 
expression or the biological activity of the CPPs of the invention. A CPP modulator that enhances 
CPP expression or activity is described as a CPP activator or agonist. Conversely, a CPP modulator 
that represses CPP expression or activity is described as a CPP inhibitor or antagonist Preferably, 
CPP modulators increase/ decrease the expression or activity by at least 5, 10, or 20%. CPP inhibitors 
include anti-CPP antibodies, fragments thereof, antisense polynucleotides, and molecules 
characterized by screening assays, as described herein. CPP agonists include polynucleotide 
expression vectors and molecules characterized by screening assays as described herein. 

A "CPP-related disorder" or "CPP-related disease" describes a cardiovascular disorder. 
Preferred disorders include coronary artery disease (CAD), coronary heart disease (CHD), peripheral 
vascular disease, cerebral ischemia (stroke), congestive heart failure, atherosclerosis, hypertension, 
and other cardiovascular diseases. Preferably, the likelihood that an individual will develop or 
already has such a disorder is indicated by higher than normal plasma levels of at least one CPP. 

Another aspect of the invention pertains to anti-CPP antibodies. The term "antibody" as used 
herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen-binding site which specifically binds (immunoreacts 



mi 
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aauoomcs ouia tu * ^rr exclusively ana *o not recognize omer polypeptides with high affinity. 
Examples of immunologically active portions of immunoglobulin molecules include F(ab) and F(ab f ) 2 
fragments winch can be generated by treating the antibody with an enzyme such as pepsin. The 
invention provides polyclonal and monoclonal antibodies that bind a CPP, or a biologically active 
fragment or homologue thereof The term "monoclonal antibody" or "monoclonal antibody 
composition", as used herein, refers to a population of antibody molecules that contain only one 
species of an antigen-binding site capable of irnmunoreacting with a particular epitope of a CPP. A 
monoclonal antibody composition thus typically displays a single binding affinity for a particular CPP 
with which it immunoreacts. Preferred CPP antibodies are attached to a label group. 

As used herein, a "label group" is any compound that, when attached to a polynucleotide or 
polypeptide (including antibodies), allows for detection or purification of said polynucleotide or 
polypeptide. Label groups may be detected or purified directly or indirectly by a secondary 
compound, including an antibody specific for said label group. Useful label groups include 

. , 32 35 3 125 

radioisotopes (e.g., P, S, H, I), fluorescent compounds (e.g., 5-bromodesoxyuridin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodarnine, dicWorotriazmylamine fluorescein, 
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dansyl chloride, phycoerythrin acetylarmnofluorene, cHgoxigenin), luminescent compounds (e.g., 
luminol, GFP, luciferin, aequorin), enzymes or enzyme co-factor detectable labels (e.g., peroxidase, 
hiciferase, alkaline phosphatase, galactosidase, or acetylcholinesterase), or compounds that are 
recognized by a secondary factor such as strepavidin, GST, or biotin. Preferably, a label group is 
attached to a polynucleotide or polypeptide in such a way as to not interfere with the biological 
activity of the polynucleotide or polypeptide. 

Radioisotopes may be detected by direct counting of radi ©emission, film exposure, or by 
scintillation counting, for example. Enzymatic labels may be detected by determination of conversion 
of an appropriate substrate to product, usually causing a fluorescent reaction. Fluorescent and 
luminescent compounds and reactions may be detected by, e.g., radioenrission, fluorescent 
microscopy, fluorescent activated cell sorting, or a luminometer. 

As used herein with respect to antibodies, an antibody is said to "selectively bind" or 
"specifically bind" to a target if the antibody recognizes and binds the target of interest but does not 
substantially recognize and bind other molecules in a sample, e.g., a biological sample, which 
includes the target of interest 

As used herein, the term "vector* refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasnrid", which refers to a 

(*i'rfii1nr rtrtiVMo <z*r*vryAc»A TYNT A 1r*o>« —1 TTVKT A k ^ A _ "U _ V i * *> j 

oi veucoi a, a max vector, wiieceiB aoaiuonai ±>iV/t segments can oe ngatetf into tne viral genome. 
Certain vectors are capable of autonomous replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). 
Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell 
upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are operatively linked. 
Such vectors are referred to herein as "expression vectors". In general, expression vectors of utility in 
recombinant DNA techniques are often in the form of plasrmds. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used form 
of vector. However, the invention is intended to include such other forms of expression vectors, such 
as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), 
which serve equivalent functions. 

As used herein, "effective amount" describes the amount of an agent, preferably a CPP 
modulator of the invention, sufficient to have a desired effect For example, an anticanfaovascular 
disorder effective amount is the amount of an agent required to reduce a symptom of a cardiovascular 
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disorder in an individual by at least 1, 2, 5, 10, 15, or preferably 25%. The term may also describe the 
amount of an agent required to ameliorate a cardiovascular disorder-caused symptom in an individual. 
Common symptoms of cardiovascular disorders include: chest pressure, heartburn, nausea, vomiting, 
numbness, shortness of breath, heavy cold sweating, unexplained fatigue, and feelings of anxiety. The 
5 more severe symptoms of cardiovascular disorders are chest pain (angina pectoris), rhythm 

disturbances (arrhythmias), stroke, or heart attack. The effective amount for a particular patient may 
vary depending on such factors as the diagnostic method of the symptom being measured, the state of 
die condition being treated, the overall health of the patient, method of administration, and the 
severity of side-effects. 

10 

CPPs of the invention 

The Cardiovascular disorder Plasma Polypeptides (CPPs) of the invention are described in the 
sequence listing as SEQ ID NOs:l-4, 5-12, 13-19 and 20-25. CPPs comprising an amino acid 
sequence selected from one of the groups consisting of SEQ ID NOs: 2-4, 6-12, 13-19 and 21-25 are 
15 secreted and circulate at an elevated level in blood plasma of individuals that have or are at risk of 
developing a cardiovascular disorder. 

Further included CPPs are polypeptides comprising an amino acid sequence selected from one 

emprise addieonalaminoacids from one of the groups ot SEQ ID NO: 1-2, 5-6, 13, and 20-21, 
20 respectively. Such additional amino acids are fused in frame with the selected sequence to form 

contiguous amino acid sequence from the proteins of SEQ ID NO: 1-2, 5-6, 13, and 20-21. 

Interestingly, levels of the CPPs of the invention are increased in the plasma of individuals 

suffering from cardiovascular disorders. As such, the CPPs of the invention provide a useful 

diagnostic tool, wherein an increased level of a CPP indicates an increased risk of developing, or the 
25 presence o£ a cardiovascular disorder. Further, CPPs are useful for drug design and in therapeutic 

strategies for prevention and treatment of cardiovascular disorders. 

CPP 12 (SEQ ID NO:2) is a previously unreported plasma form of fee antimicrobial peptide 
Defensin 5. Interestingly, the level of the CPPs of the invention is dramatically increased (46.2-fold) 
30 in the plasma of individuals suffering from cardiovascular disorders. As such, the CPPs of the 

invention provide a useful diagnostic tool, wherein an increased level of a CPP indicates an increased 
risk of developing, or the presence of, a cardiovascular disorder. Further, CPPs are useful for drug 
design and in therapeutic strategies for prevention and treatment of cardiovascular disorders. 
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Defensins are secreted as small peptides with antimicrobial properties. The most extensively 
characterized members of this family are defensms 1-4. Defensins 1-4 were cloned from a leukemia 
cell library, and expression was found in normal bone marrow cells and circulating lymphocytes 
(Daher, et al., PNAS. 85 :7327-733 1 (1988)). These proteins compose as much as 5% of the cellular 
protein of neutrophils. Defensins are processed to form peptides of approximately 35 amino acids that 
are released in response to microbial and viral infection. The small cationic peptides insert and disrupt 
prokaryotic membranes (Hill, et al. Science 251:1481-1485 (1991)). In addition, defensins reduce 
HIV replication in response to the CD8 T cell factor CAF (Zhang, et al. Science 298:995-1000 
(2002)). 

Defensin 5 (Def5) was recently cloned from Paneth gut epithelial cells and localized to 
Paneth cell granules. Unlike the other defensin proteins, Def5 is stored in secretory vesicles as a 
propeptide and is released into the intestinal lumen (Jones and Bevins, JBC 267:23216-25 (1992)). 
Def5 expression is the first described for an antimicrobial peptide in an epithelial cell and it is 
believed to play an important part in host defense in the gut Def5 expression has since been described 
in the female reproductive tract (Svinorich, et aL, Am J Obstet Gynecol (1997) 176:470-5). Def5 
peptides isolated from the intestinal lumen comprise proteolyticaUy processed forms of amino acids 
56-94 or 63-94. These represent cleavage at Arg55 and Arg62 sites, respectively, consistent with 

CPP 15 (SEQ ID NO:6) is a plasma form of the complement lysis inhibitor, CD59. 
Interestingly, die level of the CPPs of the invention is dramatically increased (43-fold) in the plasma 
of individuals suffering from cardiovascular disorders. As such, the CPPs of the invention provide a 
useful diagnostic tool, wherein an increased level of a CPP indicates an increased risk of developing, 
or the presence of, a cardiovascular disorder. Further, CPPs are useful for drug design and in 
therapeutic strategies for prevention and treatment of cardiovascular disorders. 

CD59 was originally detected on the surface of red blood cells and lymphoid cells and 
subsequently cloned (Davies, et al., J. Exp. Med. 170:637-654 (1989)). The 128 amino acid precursor 
protein has a 25 amino acid signal sequence and a C terminal prosequence of 26 amino acids. Asnl02 
is attached to the cell membrane by a GPI anchor. CD59 inhibits complement-mediated lysis by 
disrupting polymerization of the Membrane Attack Complex (MAC) component C9. WO9302188 
discloses that CD59 protects endothelial cells from lysis. Inhibition of CD59 by glycation increases 
the effects of MAC formation. Genetic reduction of CD59 results in blood cell-deficiency disorders, 
such as anemia, while its complete absence results in non-Hodgkin's lymphoma (Meletis, et al., 
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Hematol J. 2:33-7 (2001)). CD59 may be used to prevent human host rejection of transplanted cells 
(US 5573940). 

CPP 16 (SEQ ID NO: 13) is a previously un described plasma form of flavin reductase. 
5 Interestingly, the level of the CPPs of the invention is dramatically increased (nearly 10-fold) in the 
plasma of individuals suffering from cardiovascular disorders. As such, die CPPs of the invention 
provide a useful diagnostic tool, wherein an increased level of a CPP indicates an increased risk of 
developing, or the presence of, a cardiovascular disorder. Further, CPPs are useful for drug design 
and in therapeutic strategies for prevention and treatment of cardiovascular disorders. 

10 Flavin reductase was sequenced from liver cytosolic fractions and found to convert bilirubin 

to biliverdin (Yamaguchi, et aL Biochem Biophys Res Commun 197:1518-23 (1993)). The 206 amine 
acid protein is predominantly expressed in the liver and erythrocytes, and at lower levels in the heart, 
lung, adrenal gland, and cerebrum. The enzyme normally catalyzes electron transfer from reduced 
pyridine nucleotides to flavins (Chicuba, et al., Biochem. Biophys. Res. Commun. 198 :1 170-1 176 

15 (1994)). Flavin reductase plays a role in iron metabolism and preventing oxidative damage. The 
protein has been found excreted in urine (Hirota Biol Pharm Bull. 18:481-4(1995)), however, a 
plasma form has not been described. Elevated levels of flavin reductase have been discovered in the 

- , - ■ — .-r- _i,_t r> , — a tv--.- - f ^r> 

Circulation ot lius protein in the plasma is uaexpecieu ana, as suen, ts an important indicator oi 
20 disease. 

CPP 18 is a previously uncharacterized plasma form of the Microfibril Associated 
GrycoProtein-2 (MAGP-2). Ihterestingjy, the level of the CPPs of the invention is increased in the 
plasma of individuals suffering from cardiovascular disorders. As such, the CPPs of the invention 

25 provide a useful diagnostic tool, wherein an increased level of a CPP indicates an increased risk of 
developing, or the presence of, a cardiovascular disorder. Further, CPPs are useful for drug design 
and in therapeutic strategies for prevention and treatment of cardiovascular disorders. 

Microfibril-Associated Glycoprotein (MAGP)-2 was originally detected in ligament tissue 
associated with microfibrils and subsequently cloned from a genomic library (Gibson, et al., JBC 

30 271:1096-1 103 (1996)). The N terminus of MAGP-2 has an integrin-binding RGD motif, and 
mediates cell-cell interactions. It is associated with both elastin-associated and elastin-free 
microfibrils but is absent from the microfibrils of the ciliary zonule and the connective tissue around 
kidney tubules. Penner, et al. (JBC 27735044-9 (2002)) demonstrated that the C terminus of MAGP- 
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2 is associated with fibrillins 1 and 2 at a specific EGF-like repeat domain in these proteins. These 
data suggest that MAGP-2 plays a role in microfibril assembly and formation of the extracellular 
matrix. MAGP-2 is expressed in ligament, dermis, vesicular cells of the lung and heart, and 
perimysium (muscle, bone), and represents a crucial structural component of these tissues. 

The terms "Cardiovascular disorder Plasma Polypeptide" and "CPP" are used herein to 
embrace any and all of the peptides, polypeptides and proteins of the present invention. Also forming 
part of the invention are polypeptides encoded by the polynucleotides of the invention, as well as 
fusion polypeptides comprising such polypeptides. The invention embodies CPPs from humans, 
including isolated or purified CPPs consisting of, consisting essentially of, or c ompr is in g an amino 
acid sequence selected from one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. 
Further included are unmodified precursors, proteolytic precursors and intermediates of the sequence 
selected from one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. 

The present invention embodies isolated, purified, and recombinant polypeptides comprising 
a contiguous span of at least 3 amino acids, preferably at least 8 to lOamino acids, with a CPP 
biological activity. In preferred embodiments the contiguous stretch of amino acids comprises the site 
of a mutation or functional mutation, including a deletion, addition, swap or truncation of the amino 

nucleotide sequences of the invention, or a complementary sequence thereof or a fragment thereof. 

One aspect of the invention pertains to isolated CPPs, and biologically active portions thereof, 
as well as polypeptide fragments suitable for use as immunogens to raise anti-CPP antibodies, m one 
embodiment, native CPP peptides can be isolated from plasma, cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 
embodiment, CPPs are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a CPP can be synthesized chemically using peptide synthesis techniques, as described in 
the section titled "Chemical Manufacture of CPP compositions" and in Example 2. 

Typically, biologically active portions comprise a domain or motif with at least one activity of a 
CPP. A biologically active CPP may, for example, comprise at least 1, 2, 3, or 5 amino acid changes 
from the sequence selected from one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 
20-25, or comprise at least 1%, 2%, 3%, 5%, 8%, 10% or 15% change in amino acids from the 
sequence selected from one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. 
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Characterization of CPPs 

The polypeptides of the invention, CPPs, are defined by the tryptic peptides of SEQ ID NOs: 
3-4, 7-12, 14-19 and 22-25 (Figures and Table 1). These peptides were isolated from the plasma of 
Coronary Artery Disease patients and characterized according to the MicroProtTM method, as 
5 described in Example 1 . SEQ ID NOs:2, 6, 13 and 21 represent the polypeptide species found at an 
elevated level in CAD plasma from which the tryptic peptides were released. 

The CPPs of the invention are all less than or around 20kD in molecular weight, as the plasma 
sample is first separated based on molecular weight Higher molecular weight polypeptide species are 
separated and characterized by a different method As described in Example 1 , the plasma sample is 
10 subjected to a number of chromatography separations. Details about these chromatography methods 
are given in Example 1. 

The first separation is on a cation exchange chromatography column, which is eluted with 
increasing salt concentration. Eighteen fractions are collected. The CEX column in Table 1 lists 
which fractions contained each tryptic peptide, as well as its elution conditions. Separation by cation 
15 exchange provides an indication of the overall positive charge of a polypeptide species. Cation 

exchange is followed by a reverse phase HPLC separation. The RP1 column in Table 1 lists in which 
of the 30 fractions each tryptic peptide ehited, as well as its elution conditions. Separation by reverse 

-■«- - -' J * ^ ■ — *4 11 v..3^^t,-v:-^. ,r ^4 ^ , — TVc i~ ~f f^-^. 

digits oi the column labeled ftun ivumoer indicate wrncn or oie z<* eiaiea tractions aom tne secona 
20 reverse phase HPLC separation contained the tryptic peptides (see Example 1), Proteome indicates in 
which of the sample each tryptic peptide was observed. Olav scores reflect, among other things, the 
strength of the experimental MS-MS signal over noise, as detected by the MS-MS data identification 
software, and thus give an indication of the protein concentration in the sample. 

The ratio of protein levels in CAD versus control plasma samples is calculated by two 
25 methods- The first method calculates the CAD/Control ratio by the number of fractions from each 
sample containing the CPPs. The result is 7 for CPP 12, 14 for CPP 15, 4 for CPP 16, and 338 for 
CPP 18 (see Table 1). Alternatively, and more accurately, the Olav scores obtained for each peptide in 
the mass spectrometry data analysis software are used to give a weighted ratio. This result is 46.2 for 
CPP 12, 43. 1 for CPP 15, 9.58 for CPP 16, and 336 for CPP 18, indicating that CPP 12, for example, 
30 is present at 46.2 times the level in CAD plasma compared to control plasma. As such, the CPPs 
provide a useful diagnostic tool, wherein an increased level of a CPP indicates an increased risk of 
developing, or the presence o£ a cardiovascular disorder. 
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PEPTIDE SEQUENCE 


CEX 


Salt 


RP1 


%B 


Run number 


Proteome 


Olav score 


QSGEDNQDLAISFAGNGLSALR 


10 


175 


8 


33.65 


110332J58 


Control 


8.7 


QSGEDNQDLAfSFAGNGLSALR 


10 


175 


8 


33.65 


89854J09 


CAD 


37.3 


QSGEDNQDLAISFAGNGLSALR 




175 


8 


33.65 


88265J09 


CAD 

• 


66.16 


QSGEDNQDLAISFAGNGLSALR 




175 


8 


33.65 


88265_09 


CAD 


70.2 


ESLSGVCBSGR 




175 


8 


33.65 


88265_09 


CAD 


35.35 


QSGEDNQDLAISFAGNGLSALR 




175 


9 


35.7 


88277_09 


CAD 


85.61 


QSGEDNQDLAISFAGNGLSALR 




175 


10 


37.5 


88205_07 


CAD 


60.47 


ESLSGVCEISGR 




175 


11 


39.42 


88221_07 


CAD 


46.73 



Number of Control Fractions: 1 
Number of CAD Fractions: 7 
CAD/ Control: 7 

Olav Score Control: 8.7 
Olav Score CAD: 401.82 

It is interesting to note that the tryptic peptide of SEQ ID:3, which is the main peptide 
detected, belongs to the propeptide part of the protein (Ghosh, et al., Nat Immunol. (2002) 3:583-90). 
The increase in disease of this propeptide therefore most probably reflects an incomplete processing 
of the propeptide into the Defensin 5 peptide. 
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Table lb - CPP 15 



Table lb -CPP 15 
















Peptide Sequence 


CEX 


Salt 


RP1 


%B 


Run Number 


P rote o me 


Olav Score 


LQCYNCPNPTADCK 


3 


75 


10 


37.5 


110084J08 


Control 


27.76 


LQCYNCPNPTADCK 


6 


100 


9 


35.7 


130820.14 


Contro) , 


28.59 


FEHCN FNDVTTR 






10 


37.5 


130832_08 


Control 


19.67 


LQCYNCPNPTADCK 






7 


31.73 


8421 8.24 


CAD 


50.85 


LQCYNCPNPTADCK 










84158.15 


CAD 


55.7 


LQCYNCPNPTADCK 










B4158_17 


CAD 


60.92 


LQCYNCPNPTADCK 










84158.18 


CAD 


13.28 


AGLQVYNK 






8 


33.65 


84158.15 


CAD 


37.18 


LRENELTYYCCK 






84158.16 


CAD 


44.24 


LRE N ELTYYCC K 










84158.17 


CAD 


63.42 


LRENELTYYCCK 










84158.18 


CAD 


50.36 


LRENELTYYCCK 


2 


75 






84158_16 


CAD 


78.01 


LQCYNCPNPTADCK 








■ 


84246_17 


CAD 


16.91 


FEHCNFNDVTTR 










84246_1 7 


CAD 


49.08 


LRENELTYYCCK 






9 


35.7 


84246.16 


CAD 


30.38 


LRENELTYYCCK 


- 


- 


84246.17 


CAD 


14X>4 


EN ELTYYCC K 










84246.17 


CAD 


10.42 


LQCYNCPNPTADCK 






10 

• 


37.5 


130075.12 


CAD 


61.01 


LRENELTYYCCK 






130075.12 


CAD 


29.32 


LQCYNCPNPTADCK 










84346.16 


CAD 


43.81 


LQCYNCPNPTADCK 










84346.15 


CAD 


41.83 


LQCYNCPNPTADCK 






9 


35.7 


84346.15 


CAD 


28.71 


LQCYNCPNPTADCK 








84346.16 


CAD 


51.82 


FEHCNFNDVTTR 








■ 


84346.15 


CAD 


11.38 


FEHCNFNDVTTR 










84346.16 


CAD 


50.65 


LQCYNCPNPTADCK 


3 


75 


: 




84274.11 


CAD 


54.83 


LQCYNCPNPTADCK 










84274.11 


CAD 


30.68 


LQCYNCPNPTADCK 

i 










84274.09 


CAD 


77.1 


LQCYNCPNPTADCK 






10 


37.5 


84274.10 


CAD 


61.32 


LQCYNCPNPTADCK 










84274.16 


CAD 


16.69 


AGLQVYNK 










84274.09 


CAD 


28.14 


AGLQVYNK 








- 


84274.1 1 


CAD 


15.26 
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Table Ib-CPP 15 
















Peptide Sequence 


CEX 


Salt 


RP1 


%B 


Run Number 


Proteome 


Olav Score 


FEHCNFNDVTTR 










84274.1 1 


CAD 


47.52 


FEHCN FNDVTTR 










84274.10 


CAD 


50.86 


FEHCNFNDVTTR 










84274.10 


CAD 


41.6 


LRENELTYYCCK 










84274_09 .. 


CAD 


55.97 


LRENELTYYCCK 










84274.10 


CAD 


50.1 


LRENELTYYCCK 






10 


37.5 


84274.13 


CAD 


15.93 


LRENELTYYCCK 










84274_10 


CAD 


49.2 


LRENELTYYCCK 










84274.1 1 


CAD 


40.94 


LRENELTYYCCK 










84274.1 1 


CAD 


58.5 


LRENELTYYCCK 










84274.1 1 


CAD 


49.41 


LRENELTYYCCK 








• 


84274.09 


CAD 


58.43 


LQCYNCPNPTADCK 


3 


75 






84286.09 


CAD 


46.55 


LQCYNCPNPTADCK 








84286.10 


CAD 


48.82 


LQCYNCPNPTADCK 










84286_1 1 


CAD 


78.52 


LQCYNCPNPTADCK 










84286.10 


CAD 


55.56 


FEHCNFNDVTTR 






11 


39.42 


84286.09 


CAD 


45.93 


FEHCNFNDV 1 1 R 






84286.10 


CAD 


61.35 


rr:ur(acwr>\nTO 










Q/<o«e? -in 


r&n 


















■ * - — ' — « 


LRENELTYYCCK 










84286.11 


CAD 


38.65 


LRENELTYYCCK 










84286.09 


CAD 


64.74 


LQCYNCPNPTADCK 






12 


41.34 


84302.07 


CAD 


62.31 


LQCYNCPNPTADCK 






13 


43.27 


84318.08 


CAD 


40.76 


LQCYNCPNPTADCK 






84318.07 


CAD 


47.09 



26 



WO 2004/106941 



PCT/EP2004/005786 



Table 1b -CPP 15 
















Peptide Sequence 


CEX 


Salt 


RP1 


% B 


Run Number | 


Proteom© 


Olav Score 


LQCYNCPN PTADCK 










88181.14 


CAD 


60.51 


LQCYNCPNPTADCK 










88181.15 


CAD 


47.12 


LQCYNCPNPTADCK 










88181.14 


CAD 


51.75 


FEHCNFNDVTTR 






9 


35.7 


88181.14 


CAD 


47.55 


PEHCNFNDVTTR 








• 


88181.15 


CAD 


27.86 


LREN ELTYYCCK 










88181.15 


CAD 


59.21 


LRENELTYYCCK 


4 


75 






88181.14 


CAD 


58.97 


LQCYNCPNPTADCK 








88113.09 


CAD 


65.16 


LQCYNCPNPTADCK 










88113.10 


CAD 


63.32 


LQCYNCPNPTADCK 










88113.10 


CAD 


46.5 


reHCNFNDVTTR 






10 


37.5 


88113.09 


CAD 


58.76 


FEHCNFNDVTTR 










88113.11 


CAD 


51.97 


FEHCNFNDVTTR 










88113.10 


CAD 


56.4 


LRENELTYYCCK 










88113.10 


CAD 


71.05 


AGLQVYNK 


5 


75 


11 


39.42 


101105.10 


CAD 


20.74 


FEHCNFNDVTTR 




101105.10 


CAD 


32.73 



















Number of Control Fractions: 3 
Number of CAD Fractions: 72 
CAD/ Control: 14 

Control Olav Score: 76.02 
CAD Olav Score: 3276.65 
CAD/ Control: 43.1 
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Table lc-CPP 16 



Peptide Sequence 


CEX 


Salt 


RP1 


%B 


Run Number 


Proteome 


Olav Score 


NDLSPTTVMSEGAR 


10 


175 


19 


54.8 


110496.07 


Control 


19.93 


HDLGHFMLR 


11 


175 


14 


45.2 


1 1 0432_09 


Control 


8.89 


TVAGQDAVIVLLGTR 










89882_10 


CAD 


11.58 


WAVMPPHIGDQPLTGAYTVTLDGR 






15 


47.1 


89882JI 1 


CAD 


23.91 


CLTTOEYDGHSTYPSHQYQ 


10 


175 




■ 


89882_12 


CAD 


59.47 


LQAVTDDHIR 






8791 7_09 


CAD 


16.58 


WAVMPPHIGDQPLTGAYTVTLDGR 






16 


49.02 


8791 7 J)9 


CAD 


37.81 


CLTTDEYDGHSTYPSHQYQ 










8791 7_Q9 


CAD 


48.4 


TVAGQDAVIVLLGTR 


11 


175 


16 


49.02 


8971 0_09 


CAD 


38.93 


NDLSPTTVMSEGAR 


13 


175 


17 


50.95 


92124_10 


CAD 


39.54 



Number of Control Fractions: 2 
5 Number of CAD Fractions: 8 
CAD/ Control: 4 

■Uia v c&oie <^tllv. q.z^ 
10 CAD/ Control: 9.58 



28 



WO 2004/106941 PCT/EP2004/005786 



Table Id -CPP 18 



Dtt*\4t#f a Qatfii iaii t*o 

repuuB o«» (4 uei iw 


UCA 


vol 1 


dm 


/O D 




■ * UWUIIlO 


wiav ocore 


LKPCfcN V ULUKr v* o L 






6 


29.8 


41010(1 11 




Oft 7C 


CKIVCD4 DD^CM\/ni ODDM^I 






f 


11 71 


44f\14fi 11 

1 iuoio_io 




ca no 
54.02 


UCInULOr 1 oLK 


11 
1 1 


17*; 

1 I o 






1 lUooU_l r 


v»onurOi 


54.47 


QClnQLCr 1 oLK 






1 1 


1Q, AO 


Pinion 4ft 


control 




UUInULUr 1 £rl_r\ 










1 1 UOoU__ 1 0 




CO 4 

OO. 1 


» ODPCKft/TM ADDMCt 

LJ^rOtTOVULUKKNoL 






f 


14 "71 

ol.ro 


4 04 OCC 4 0 

121055_12 


oontroi 


oe ire 
OO.So 


LYSVHRPVK 


12 


175 


11 


39.42 


1211G7_11 


Control 


40.83 


ppit_jp<i pctc? o 
uCinULCr 1 oL-K 






A"7 4 4 

121 1U/_/l 1 


UOhuOJ 


00.47 


opiuph pcTCi o 










oyouo_1 / 


PAH 

CAD 


CO T7 

58.77 


PPlVJPil D 
Wv/JrlwLOr 1 oLK 










oyoUo_ i / 


LAD 


4 c on 

15.39 




10 


175 


11 


39.42 


oyouo^i y 


CAD 


OO R4 

22.91 


vJOlnvJLL/r 1 OL.K 


■oycHio__io 


CAD 


y|G OO 

46.92 


/■"IPlUpjl rCTC1 15 








■ 


RORAA 4 ft 


CAD 


52.49 


UvlnULLrr 1 Oi_r\ 










onona *c 

oyoup^io 


par 
CAD 


Af\ EC 


p*piurn rFTQI D 

Uvl nULUr 1 OL>r\ 










Oo2UO_ 14 


pah 
CAD 


24.27 


PiPii-im rcTQi d 

WOlrlUL.Cr 1 OLX\ 






in 


o r .*j 


OQOAe 4C 

oo2UO__io 




ftA 1 

bU.o 


npiuiPji rcTCi d 
ULyinLJLOr I oU\ 










O02UD_l 5 


par 

| CAD 


55.51 


<"VPIU01 r^CTCI D 

WiviriULUr 1 5>LK 










oo221_13 


CAD 


^ >* 00 
O4^o 


(VILini pcttci D 








■ 


ot>221_12 


P A l™\ 

CAD 


CO OT 

52.37 


UCInULCr 1 oLK 


1 1 


17ft 
I/O 






oo221_2U 


CAD 


4o.o4 


\JClnULCr- 1 oLK 






11 


39.42 


oo221_14 


CAD 


CO A 

53.1 


PlPILlr"^! PCTQI D 

LiUlnvJLCr 1 oLK 






QQ0O4 4ft 

oo221_1b 


CAD 


cc c 


UCInULOr ToLK 










Oo221_ 1 7 


p A 

CAD 


55.12 


woinwLor i oi_T\ 










AA991 1Q 


PAn 


fto T7 
DO. f r 


QCIHQLCFTSLR 










88221_15 


CAD 


36.83 


QCIHQLCFTSLR 










8B221JI6 


CAD 


19.28 


SNYFRLPPCENVDLQRPNGL 










101017J1 


CAD 


54.96 


ShlYFRLPPCENVDLQRPNGL 






7 


31.73 


101017JI0 


CAD 


82.4 


SNYFRLPPCENVDLQRPNGL 


12 


175 




101017_10 


CAD 


77.51 


SNYFRLPPCENVDLQRPNGL 






101017_09 


CAD 


49.18 


QCIHQLCFTSLR 






10 


37.5 


100965 JO 


CAD 


52.98 


QCIHQLCFTSLR 






100965_14 


CAD 


11.32 


SNYFRLPPCENVDLQRPNGL 


13 


175 


7 


31.73 


89746_15 


CAD 


42.31 


LPPCENVDLQRPNGL 




89746_15 


CAD j 


27.71 
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Number of Control Fractions:8 
Number of CAD Fractions:27 
CAD/ Control: 3.38 

Olav Score Control: 3753 
Olav Score CAD: 1260.34 
CAD/ Control: 3.36 

It is interesting to note that CEP 18, as detected by the methods described in Example 1, 
comprise at least two distinct species. The first one elutes in RP1 fractions 6 and 7, and contains the 
following tryptic peptides: SNYFRLPPCENVDLQRPNGL and LPPCENVDLQRPNGL. Hie second one 
elutes in RP1 fractions 10 and 1 1, and is therefore more hydrophobic, and contains only the 
following, different, tryptic peptides: QCIHQLCFTSLR and LYSVHRPVK. 

CPP nucleic acids 

One aspect of the invention pertains to purified or isolated nucleic acid molecules that encode 

f^TiT> c orWftV^collv ort+^trA -nrYrfifMte «-V»<*rp.fvf oc r1f»crrihfvl Vtprefn well rnirlpic r^fviH 

diagnostic methods and in drug screening assays as further described herein. 

An object of the invention is a purified, isolated, or recombinant nucleic acid coding for a CPP, 
complementary sequences thereto, and fragments thereof. The invention also pertains to a purified or 
isolated nucleic acid comprising a polynucleotide having at least 95% nucleotide identity with a 
polynucleotide coding for a CPP, advantageously 99 % nucleotide identity, preferably 99.5% 
nucleotide identity and most preferably 99.8% nucleotide identity with a polynucleotide coding for a 
CPP, or a sequence complementary thereto or a biologically active fragment thereof. Another object 
of the invention relates to purified, isolated or recombinant nucleic acids comprising a polynucleotide 
that hybridizes, under the stringent hybridization conditions defined herein, with a polynucleotide 
coding for a CPP, or a sequence complementary thereto or a variant thereof or a biologically active 
fragment thereof. 

hi another preferred aspect, the invention pertains to purified or isolated nucleic acid molecules 
that encode a portion or variant of a CPP, wherein the portion or variant displays a CPP biological 
activity. Preferably said portion or variant is a portion or variant of a naturally occurring CPP or 
precursor thereof. 
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Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding a 
CPP comprising, consisting essentially o£ or consisting of the amino acid sequence selected from one 
of the groups of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25, or fragments thereof, wherein the isolated 
nucleic acid molecule encodes one or more motifs such as, for CPP 12: a bacterial endotoxin binding 
site or a disulfide bond; for CPP 15: a target binding site (e.g., complement C9), a GPI-anchor site, or 
a disulfide bond; for CPP 1 6: a target binding site (e.g., NAD/ NADP-binding); and for CPP 1 8: an 
EGF-like repeat domain, an RGD motif, or a disulfide bond. 

The nucleotide sequence determined from the cloning of the CPP-encoding gene allows for 
the generation of probes and primers designed for use in identifying and/or cloning other CPPs (e.g. 
sharing the novel functional domains), as well as CPP homologues from other species. 

A nucleic acid fragment encoding a "biologically active portion of a CPP" can be prepared by 
isolating a portion of a nucleotide sequence coding for a CPP, which encodes a polypeptide having a 
CPP biological activity, expressing the encoded portion of the CPP (e.g., by recombinant expression 
in vitro or in vivo) and assessing the activity of the encoded portion of the CPP. 

The invention further encompasses nucleic acid molecules that differ from the CPP nucleotide 
sequences of the invention due to degeneracy of the genetic code and encode the same CPPs of the 
invention. 

slotted in the art that UN A sequence polymorphisms tnat ieaa co cnanges in ine amino aciu sequence 
of the CPPs may exist within a population (e.g., the human population). Such genetic polymorphism 
may exist among individuals within a population due to natural allelic variation. Such natural allelic 
variations can typically result in 1-5% variance in the nucleotide sequence of a CPP-encoding gene or 
nucleic acid sequence. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of the CPP 
nucleic acids of the invention can be isolated based on their homology to the CPP nucleic acids 
disclosed herein using the cDNAs disclosed herein, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

It will be appreciated that the invention comprises polypeptides having an amino acid 
sequence encoded by any of the polynucleotides of the invention. 

Uses of CPP nucleic acids 

Polynucleotide sequences (or the complements thereof) encoding CPPs have various 
applications, including uses as hybridization probes, in chromosome and gene mapping, and in the 



« 
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generation of antisense RNA and DNA. In addition, CPP-encoding nucleic acids are useful as targets 
for pharmaceutical intervention, e.g. for the development of DNA vaccines, and for the preparation of 
CPPs by recombinant techniques, as described herein. The polynucleotides described herein, 
including sequence variants thereof, can be used in diagnostic assays. Accordingly, diagnostic 
methods based on detecting the presence of such polynucleotides in body fluids or tissue samples are 
a feature of the present invention. Examples of nucleic acid based diagnostic assays in accordance 
with the present invention include, but are not limited to, hybridization assays, e.g., in situ 
hybridization, and PCR-based assays. Polynucleotides, including extended length polynucleotides, 
sequence variants and fragments thereof as described herein, may be used to generate hybridization 
probes or PCR primers for use in such assays. Such probes and primers will be capable of detecting 
polynucleotide sequences, including genomic sequences that are similar, or complementary to, the 
CPP polynucleotides described herein. 

The invention includes primer pairs for carrying out a PCR to amplify a segment of a 
polynucleotide of the invention. Each primer of a pair is an oligonucleotide having a length of 
between 15 and 30 nucleotides such that i) one primer of the pair forms a perfectly matched duplex 
with one strand of a polynucleotide of the invention and the other primer of the pair form a perfectly 
match duplex with the complementary strand of the same polynucleotide, and ii) the primers of a pair 

~- • 1 r—« o»Vs ■y^<±.-r£'o.~L<\'\* ^^rvA^I A An — .1 r~.-.r s^r, - -4-o»- ~»— +V -» — v -.1-,-*-,^^ nr *.4~Z Ale* « ~-rr~ J r ~ A-~±^^~ ~o C 

Oetween iu ana^juu nucieoaaes. iTeieraoiy, tne annealing tempera tore or eacn pnmer oi a pan* to 
its respective complementary sequence is substantially the same. 

Hybridization probes derived from polynucleotides of the invention can be used, for example, 
in performing in situ hybridization on tissue samples, such as fixed or frozen tissue sections prepared 
on microscopic slides or suspended cells. Briefly, a labeled DNA or RNA probe is allowed to bind its 
DNA or RNA target sample in the tissue section on a prepared microscopic, under controlled 
conditions. Generally, dsDNA probes consisting of the DNA of interest cloned into a plasmid or 
bacteriophage DNA vector are used for this purpose, although ssDNA or ssRNA probes may also be 
used. Probes are generally oligonucleotides between about 15 and 40 nucleotides in length. 
Alternatively, the probes can be polynucleotide probes generated by PCR random priming primer 
extension or in vitro transcription of RNA from plasmids (riboprobes). These latter probes are 
typically several hundred base pairs in length. The probes can be labeled by amy of a number of label 
groups and the particular detection method will correspond to the type of label utilized on the probe 
(e.g~, autoradiography, X-ray detection, fluorescent or visual microscopic analysis, as appropriate). 
The reaction can be further amplified in situ using immunocytochemical techniques directed against 
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the label of the detector molecule used, such as an antibody directed to a fluorescein moiety present 
on a fluorescentfy labeled probe. Specific labeling and in situ detection methods can be found, for 
example, in Howard, G. C, Ed., Methods in Nonradioactive Detection, Appleton & Lange, Norwalk, 
Conn., (1993), herein incorporated by reference. 

Hybridization probes and PCR primers may also be selected from the genomic sequences 
corresponding to the full-length proteins identified in accordance with the present invention, including 
promoter, enhancer elements and introns of the gene encoding the naturally occurring polypeptide. 
Nucleotide sequences encoding a CPP can also be used to construct hybridization probes for mapping 
the gene encoding that CPP and for the genetic analysis of individuals. Individuals carrying variations 
of, or mutations in the gene encoding a CPP of the present invention may be detected at the DNA 
level by a variety of techniques. Nucleic acids used for diagnosis may be obtained from a patient's 
cells, including, for example, tissue biopsy and autopsy material. Genomic DNA may be used directly 
for detection or may be amplified enzymatically by using PCR (SaiM, et al. Nature 324: 163-1 66 
(1986)) prior to analysis. RNA or cDNA may also be used for the same purpose. As an example, PCR 
primers complementary to the nucleic acid of the present invention can be used to identify and 
analyze mutations in the gene of the present invention. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the normal genotype. Point mutations can be 



radiolabeled annsense DiNA sequences ot the invention. Sequence cnanges at specific locations may 
also be revealed by nuclease protection assays, such as RNase and SI protection or the chemical 
cleavage method (e.g. Cotton, et al., Proa Natl. Acad. Sci. USA 85:4397-4401 (1985)), or by 
differences in melting temperatures. "Molecular beacons" (Kostrikis L. G. et al., Science 279:1228- 
1229 (1998)), hairpin-shaped, single-stranded synthetic oligonucleotides containing probe sequences 
which are complementary to the nucleic acid of the present invention, may also be used to detect point 
mutations or other sequence changes as well as monitor expression levels of CPPs. 



Oligonucleotide and Antisense Compounds 

Oligonucleotides of the invention, including PCR primers and antisense compounds, are 
synthesized by conventional means on a commercially available automated DNA synthesizer, e.g. an 
30 Applied Biosystems (Foster City, CA) model 380B, 392 or 394 DNA/RNA synthesizer, or like 

instrument Preferably, phosphoramidite chemistry is employed, e.g. as disclosed in the following 
references: Beaucage and Iyer, Tetrahedron, 48: 2223-231 1 (1992); Molko et al, U.S. patent 
4,980,460; Koster et al, U.S. patent 4,725,677; Caruniers et al, U.S. patents 4,415,732; 4,458,066; and 
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4,973,679; and the like. For therapeutic use, nuclease resistant backbones are preferred. Many types 
of modified oligonucleotides are available that confer nuclease resistance, e.g. phosphorothioate, 
phosphorodithioate, phosphoiamidate, or die like, described in many references, e.g. 
phosphorothioates: Stec et al, U.S. patent 5,151,510; Hirschbein, U.S. patent 5,166,387; Bergot, U.S. 
5 patent 5,183,885; phosphoramidates: Froehler et al, International application PCT/US90/O3138; and 
for a review of additional applicable chemistries: Uhlmann and Peyman (cited above). Hie length of 
the antisense oligonucleotides has to be sufficiently large to ensure that specific binding will take 
place only at the desired target polynucleotide and not at other fortuitous sites. The upper range of 
the length is determined by several factors, including the inconvenience and expense of synthesizing 
10 and purifying oligomers greater than about 30-40 nucleotides in length, the greater tolerance of longer 
oligonucleotides for mismatches than shorter oligonucleotides, and the like. Preferably, the antisense 
oligonucleotides of the invention have lengths in the range of about 15 to 40 nucleotides. More 
preferably, the oligonucleotide moieties have lengths in the range of about 18 to 25 nucleotides. 
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Primers and probes 

Primers and probes of die invention can be prepared by any suitable method, including, for 
example, cloning and restriction of appropriate sequences and direct chemical synthesis by a method 



ph os ph od i e st e r method of Brown BL et al (Methods iaizymoi 1979;68:109-151), the 
diethylphosphoramidite method of Beaucage et al (Tetrahedron Lett 1981, 22: 1859-1862) and the 
solid support method described in EP 0 707 592, the disclosures of which are incorporated herein by 
reference in their entireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs such 
as, for example peptide nucleic acids which are disclosed in International Patent Application WO 
92/20702, morpholino analogs which are described in U.S. Patents Numbered 5,185,444; 5,034,506 
and 5,142,047. If desired, the probe may be rendered "non-extendable" in that additional dNTPs 
cannot be added to the probe. In and of themselves analogs usually are non-extendable and nucleic 
acid probes can be rendered non-extendable by modifying die 3 f end of the probe such that the 
hydroxyl group is no longer capable of participating in elongation. For example, the 3' end of the 
probe can be functionalized with the capture or detection label to thereby consume or otherwise block 
the hydroxyl group. 

Any of die polynucleotides of the present invention can be labeled, if desired, by incorporating 
any label group known in the art to be detectable by spectroscopic, photochemical, biochemical, 
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immunochemical, or chemical means. Additional examples include nonradioactive labeling of 
nucleic acid fragments as described in Urdea et al. (Nucleic Acids Research. 1 1 :4937-4957, 1988) or 
Sanchez-Pescador et al. (J. Clin. Microbiol. 26(10): 1934- 193 8, 1988). fa addition, the probes 
according to the present invention may have structural characteristics such that they allow the signal 
amplification, such structural characteristics being, for example, branched DNA probes as those 
described by Urdea et al (Nucleic Acids Symp. Sen 24: 197-200, 1991) or in the European patent No. 
EP 0225807 (Chiron). 

A label can also be used to capture the primer, so as to facilitate fee immobilization of either 
the primer or a primer extension product, such as amplified DNA, on a solid support. A capture label 
is attached to the primers or probes and can be a specific binding member which forms a binding pair 
with the solid's phase reagent's specific binding member (e.g. biotin and streptavidin). Therefore 
depending upon the type of label carried by a polynucleotide or a probe, it may be employed to 
capture or to detect the target DNA. Further, it will be understood that the polynucleotides, primers 
or probes provided herein, may, themselves, serve as the capture label. For example, in the case 
where a sohd phase reagent's binding member is a nucleic acid sequence, it may be selected such that 
it binds a complementary portion of a primer or probe to thereby immobilize the primer or probe to 
the solid phase, hi cases where a polynucleotide probe itself serves as the binding member, those 

*<ji**y^niziiu+ij co uiw ioi 6 u. m axe v,^cic u puAyauvicoaoe primer icseir serves as tne capture 
label, at least a portion of the primer will be free to hybridize with a nucleic acid on a solid phase. 
DNA labeling techniques are well known to the skilled technician. 

The probes of the present invention are useful for a number of purposes. They can be notably 
used in Southern hybridization to genomic DNA The probes can also be used to detect PCR 
amplification products. They may also be used to detect mismatches in CPP-encoding genes or 
mRNA using other techniques. 

Any of the nucleic acids, polynucleotides, primers and probes of fee present invention can be 
conveniently immobilized on a solid support. Solid supports are known to those skilled in the art and 
include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, 
nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other animal) red 
blood cells, duracytes and others. The solid support is not critical and can be selected by one skilled 
in the art. Thus, latex particles, microparticles, magnetic or non-magnetic beads, membranes, plastic 
tubes, walls of nricrotiter wells, glass or silicon chips, sheep (or other suitable animal's) red blood 
cells and duracytes are all suitable examples. Suitable methods for immobilizing nucleic acids on 
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solid phases include ionic, hydrophobic, covalent interactions and the like. A solid support, as used 
herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture reagent 
Alternatively, the solid phase can retain an additional receptor which has the ability to attract arid 
immobilize the capture reagent The additional receptor can include a charged substance that is 
oppositely charged with respect to the capture reagent itself or to a charged substance conjugated to 



the 



specific 



member attached to the solid support and which has the ability to immobilize the capture reagent 
through a specific binding reaction. The receptor molecule enables the indirect binding of the capture 

10 reagent to a solid support material before the performance of the assay or during the performance of 
the assay. The solid phase thus can be a plastic, derivatized plastic, magnetic or non-magnetic metal, 
glass or silicon surface of a test tube, microtiter well, sheet, bead, microparticle, chip, sheep (or other 
suitable animal's) red blood cells, duracytes and other configurations known to those of ordinary skill 
in the art Hie nucleic acids, polynucleotides, primers and probes of the invention can be attached to 

15 or immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 25 
distinct polynucleotides of the invention to a single solid support hi addition, polynucleotides other 
than those of the invention may be attached to the same solid support as one or more polynucleotides 



j\ay poiynuuieouuc piu vioea nereis iaay attacu^u in ove rlapping areas ur at ranuom 
20 locations on a solid support Alternatively the polynucleotides of the invention may be attached in an 
ordered array wherein each polynucleotide is attached to a distinct region of the solid support which 
does not overlap with the attachment site of any other polynucleotide. Preferably, such an ordered 
array of polynucleotides is designed to be "addressable" where the distinct locations are recorded and 
can be accessed as part of an assay procedure. Addressable polynucleotide arrays typically comprise 
25 a plurality of different oligonucleotide probes that are coupled to a surface of a substrate in different 
known locations. The knowledge of the precise location of each polynucleotides location makes these 
"addressable" arrays particularly useful in hybridization assays. Any addressable array technology 
known in the art can be employed with the polynucleotides of the invention. One particular 
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30 described in US Patent 5, 143,854; PCT publications WO 90/1 5070 and 92/10092, the disclosures of 
which are incorporated herein by reference in their entireties. 



36 



WO 2004/106941 



PCT/EP2004/005786 



Methods for obtaining variant nucleic acids and polypeptides 

m addition to naturally-occurring allelic variants of the CPP sequences that may exist in the 
population, the skilled artisan will appreciate that changes can be introduced by mutation into the 
nucleotide sequences coding for CPPs, thereby leading to changes in the amino acid sequence of the 
encoded CPPs, with or without altering the functional ability of the CPPs. 

Several types of variants are contemplated including 1) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue and such 
substituted amino acid residue may or may not be one encoded by the genetic code, or 2) one in which 
one or more of the amino acid residues includes a substituent group, or 3) one in which the mutated 
CPP is fused with another compound, such as a compound to increase the half-life of me polypeptide 
(for example, polyethylene glycol), or 4) one in which the additional amino acids are fused to the 
CPP, such as a leader, a signal or anchor sequence, a sequence which is employed for purification of 
the CPP, or sequence from a precursor protein. Such variants are deemed to be within the scope of 
those skilled in the art 

For example, nucleotide substitutions leading to amino acid substitutions can be made in the 
sequences that do not substantially change the biological activity of the protein. An ammo acid 
residue-can be altered from the wild-type sequence encoding a CPP, or a biologically active fragment 

shared among the CPPs of the present invention are predicted to be less amenable to alteration. 

In another aspect, the invention pertains to nucleic acid molecules encoding CPPs that contain 
changes in amino acid residues that result in increased biological activity, or a modified biological 
activity. In another aspect, the invention pertains to nucleic acid molecules encoding CPPs that 
contain changes in amino acid residues that are essential for a CPP biological activity. Such CPPs 
differ in amino acid sequence from SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25 and display reduced 
activity, or essentially lack one or more CPP biological activities. ' 

Mutations, substitutions, additions, or deletions can be introduced into any of SEQ ID NOs: 1- 
4, 5-12, 13-19 and 20-25, by standard techniques, such as site-directed mutagenesis and PCR- 
mediated mutagenesis. For example, conservative amino acid substitutions may be made atone or 
more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in 
which the amino acid residue is replaced with an amino acid residue having a similar side chain. 
Families of amino acid residues having similar side chains have been defined in the art These 
families include amino acids with basic side chains (e.g., lysine, argmine, histidine), acidic side 
chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, 
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glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, 
histidine). Thus, a predicted nonessential amino acid residue in a CPP, or a biologically active 
fragment or homologue thereof may be replaced with another ammo acid residue from the same side 
chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all 
or part of a CPP coding sequence, such as by saturation mutagenesis, and the resultant mutants can be 
screened for CPP biological activity to identify mutants that retain activity. Following mutagenesis of 
the nucleotide encoding one of SEQ ED NOs: 1-4, 5-12, 13-19 and 20-25, the encoded protein can be 
expressed recombinantry and the activity of the protein can be determined in any suitable assay, for 
example, as provided herein, 

. Hie invention also provides CPP chimeric or fusion proteins* As used herein, a CPP "chimeric 
protein" or "fusion protein" comprises a CPP of the invention or fragment thereof, operatively linked 
or fused in frame to a non-CPP polypeptide sequence, hi a preferred embodiment, a CPP fusion 
protein comprises at least one biologically active portion of a CPP. In another preferred embodiment, 
a CPP fusion protein comprises at least two biologically active portions of a CPP. For example, in 
one embodiment, the fusion protein is a GST-CPP fusion protein in which CPP domain sequences are 

recombinant ci^s. in anotner embodiment, trie fusion protein is a cfr containing a Heterologous 
signal sequence at its N-terminus, for example, to allow for a desired cellular localization in a certain 
host cell. In yet another embodiment, the fusion is a CPP biologically active fragment and an 
immunoglobulin molecule. Such fusion proteins are useful, for example, to increase the valency of 
CPP binding sites. For example, a bivalent CPP binding site may be formed by fusing biologically 
active CPP fragments to an IgG Fc protein. 

CPP fusion proteins of the invention can be used as immunogens to produce anti-CPP 
antibodies in a subject, to purify CPP or CPP ligands, and in screening assays to identify CPP 
modulators. 

Furthermore, isolated fragments of CPPs can also be obtained by screening peptides 
recombinantry produced from the corresponding fragment of the nucleic acid encoding such peptides, 
fn addition, fragments can be chemically synthesized using techniques known in the art such as 
conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, a CPP of the present 
invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, 
or preferably divided into overlapping fragments of a desired length. The fragments can be produced 
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(recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments with a CPP 
biological activity, for example, by microinjection assays or in vitro protein binding assays. In an 
illustrative embodiment, peptidyl portions of a CPP, such as a CPP target binding region, can be 
tested for CPP activity by expression as thioredoxin fusion proteins, each of which contains a discrete 
fragment of the CPP (see, for example, ILS. Patents 5, 270,181 and 5,292,646; and PCT publication 
WO94/02502, the disclosures of which are incorporated herein by reference). 

In addition, libraries of fragments of a CPP coding sequence can be used to generate a 
variegated population of CPP fragments for screening and subsequent selection of variants of a CPP. 
m one embodiment, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of CPP coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule, denaturing die double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI nuclease, and 
ligating the resulting fragment library into an expression vector. By this method, an expression library 
can be derived which encodes N-terminal, C-ternrinal and internal fragments of various sizes of the 
CPP. 

Whether a change in the amino acid sequence of a peptide results in a functional CPP 



peptide. Peptides in wmcn more man one replacement has ta&en place can readily be tested in the 
20 same manner. 
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Chemical Manufacture of CPP Compositions 

Peptides of the invention are synthesized by standard techniques (e.g. Stewart and Young, 
Solid Phase Peptide Synthesis, 2nd Ed., Pierce Chemical Company, Rockford, IL, 1984). Preferably, 
a commercial peptide synthesizer is used, e.g. Applied Biosy stems, Inc. (Foster City, CA) model 
430A, and polypeptides of the invention may be assembled from multiple, separately synthesized and 
purified, peptide in a convergent synthesis approach, e.g. Kent et al, U.S. patent 6, 1 84,344 and 
Dawson and Kent, Annu. Rev. Biochem., 69: 923-960 (2000). Peptides of the invention may be 
assembled by solid phase synthesis on a cross-linked polystyrene support starting from the carboxyl 
terminal residue and adding amino acids in a stepwise fashion until the entire peptide has been 
formed. Hie following references are guides to the chemistry employed during synthesis: Schnolzer 
et al, Int. J. Peptide Protein Res., 40: 180-193 (1992); Merrifield, J. Amer. Chem. Soc., Vol. 85, pg. 
2149 (1963); Kent et al., pg 185, in Peptides 1984, Ragnarsson, Ed. (Almquist and Weksell, 
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Stockholm, 1984); Kent et al., pg. 217 in Peptide Chemistry 84, Izumiya, Ed (Protein Research 
Foundation, BH. Osaka, 1985); Merrifield, Science, VoL 232, pgs. 341-347 (1986); Kent, Ann. Rev. 
Biochem, VoL 57, pgs. 957-989 (1988), and references cited in these latter two references. 

Preferably, chemical synthesis of polypeptides of the invention is carried out by the assembly 
of peptide fragments by native chemical ligation, as described by Dawson et al, Science, 266: 776-779 
(1994) and Kent el al, U.S. patent 6,184,344. Briefly, in the approach a first peptide fragment is 
provided with an N-terminal cysteine having an unoxidized sulfhydryl side chain, and a second 
peptide fragment is provided with a C-termmal mioester. The unoxidized sulfhydryl side chain of the 
N-terminal cysteine is then condensed with the C-terminal mioester to produce an intermediate 
peptide fragment which links the first and second peptide fragments with a P-aminothioester bond. 
The p-aminothioester bond of the intermediate peptide fragment then undergoes an intramolecular 
rearrangement to produce the peptide fragment product which links the first and second peptide 
fragments with an amide bond. Preferably, the N-tenninal cysteine of the internal fragments is 
protected from undesired cyclization and/or concatenation reactions by a cyclic thiazolidine 
protecting group as described below. Preferably, such cyclic thiazolidine protecting group is a 
thioprolinyl group. 

Peptide fragments having a C-terminal mioester may be produced as described in the 

al, Proc. Natl. Acad. Set., 92: 12485-12489 (1995); Blake, Jnt J. Peptide Protein Res., 17: 273 (1981); 
Canne et al, Tetrahedron Letters, 36: 1217-1220 (1995); Hackeng et al, Proc. Natl. Acad. ScL, 94: 
7845-7850 (1997); or Hackeng et al, Proc. Natl. Acad. ScL, 96: 10068-10073 (1999). Preferably, the 
method described by Hackeng et al (1999) is employed. Briefly, peptide fragments are synthesized on 
a solid phase support (described below) typically on a 0.25 mmol scale by using the in situ 
neutralization/HBTU activation procedure for Boc chemistry disclosed by Schnolzer et al, hit J. 
Peptide Protein Res., 40: 180-193 (1992), which reference is incorporated herein by reference. 
(HBTU is 2<lH-benzotriazol-l-yl)-l,l,3,3-tetran^ and Boc is tert- 

butoxycarbonyl). Each synthetic cycle consists of Nf-Boc removal by a 1- to 2- minute treatment 

* 

with neat TFA, a 1 -minute DMF flow wash, a 10- to 20-minute coupling time with 1.0 mmol of 
preactivated Boc-amino acid in the presence of DIEA, and a second DMF flow wash. (TFA is 
trifluoroacetic acid, DMF is N,N-dimethylfGrmamide, and DIEA is N,N-diisopropyIethylamine). N*- 
Boc-amino acids (1.1 mmol) are preactivated for 3 minutes with 1 .0 mmol of HBTU (0.5 M in DMF) 
in the presence of excess DIEA (3 mmol). After each coupling step, yields are determined by 
measuring residual free amine with a conventional quantitative ninhydrin assay, e.g. as disclosed in 
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Sarin et al, Anal. Biochem., 1 17: 147-157 (1981). After coupling of Gin residues, a DCM flow wash 
is used before and after deprotection by using TFA, to prevent possible high-temperature 
(TFA/DMF)-catalyzed pyrrolidone formation. After chain assembly is completed, the peptide 
fragments are deprotected and cleaved from the resin by treatment with anhydrous HF for 1 hour at 
0°C with 4% /7-cresol as a scavenger. The imidazole side-chain 2,4-dinitrophenyl (dnp) protecting 
groups remain on the His residues because the dnp-removal procedure is incompatible with C- 
terminal thioester groups. However, dnp is gradually removed by thiols during the ligation reaction. 
After cleavage, peptide fragments are precipitated with ice-cold diefhylether, dissolved in aqueous 
acetonitrile, and lyophilized. 

Thioester peptide fragments described above are preferably synthesized on a trityl -associated 
mercaptopropionic acid-leucine (TAMPAL) resin, made as disclosed by Hackeng et al (1999), or 
comparable protocol. Briefly, N^-Boc-Leu (4 mmol) is activated with 3.6 mmol of HBTU in the 
presence of 6 mmol of DBEA and coupled for 16 minutes to 2 mmol of p-methylbenzhydrylarrrine 
(MBHA) resin, or the equivalent Next, 3 mmol of S-trityl mercaptopropionic acid is activated with 
2.7 mmol of HBTU in the presence of 6 mmol of DIEA and coupled for 16 minutes to Leu-MBHA 
resin. Hie resulting TAMPAL resin can be used as a starting resin for polypeptide-chain assembly 
after removal of the trityl protecting group with two 1 -minute treatments with 3.5% triisopropylsilane 

standard in situ-neutral ization peptide coupling protocols for 1 hour, as disclosed in Schnolzer et al 
(cited above). Treatment of the final peptide fragment with anhydrous HF yields the C-terminal 
activated mercaptopropionic acid-leucine (MPAL) thioester peptide fragments. 

Preferably, thiazolidine-protected thioester peptide fragment intermediates are used in native 
chemical ligation under conditions as described by Hackeng et al (1999), or like conditions. Briefly, 
0.1 M phosphate buffer (pH 8.5) containing 6 M guanidine, 4% (vol/vol) benzylmercaptan, and 4% 
(vol/vol) thiophenol is added to dry peptides to be ligaied, to give a final peptide concentration of 1-3 
mM at about pH 7, lowered because of die addition of thiols and TFA from the lyophilized peptide. 
Preferably, the ligation reaction is performed in a heating block at 37°C and is periodically vortexed 
to equilibrate the thiol additives. The reaction may be monitored for degree of completion by 
MALDI-MS or HPLC and electrospray ionization MS. 

After a native chemical ligation reaction is completed or stopped, the N-terminal thiazolidine 
ring of the product is opened by treatment with a cysteine deprotecting agent, such as Q- 
methylhydroxylamine (0.5 M) at pH 33-4.5 for 2 hours at 37° C, after which a 10-fold excess of Tris- 
(2-carboxyethyl)-phosphine is added to the reaction mixture to completely reduce any oxidizing 



41 



WO 2004/106941 PCT7EP2004/005786 

reaction constituents prior to purification of the product by conventional preparative HPLC. 
Preferably, fractions containing the ligation product are identified by electrospray MS, are pooled, 
and lyophilized. 

After the synthesis is completed and the final product purified, the final polypeptide product 
5 may be refolded by conventional techniques, e.g. Creighton, Meth. Enzymol., 107: 305-329 (1984); 
White, Mem. EnzymoL, 11: 481-484 (1967); Wetlaufer, Meth Enzymol., 107: 301-304 (1984); and 
die lite. Preferably, a final product is refolded by air oxidation by the following, or like: The 
reduced lyophilized product is dissolved (at about 0.1 mg/mL) in 1 M guanidine hydrochloride (or 
like chaotropic agent) with 100 mM Tris, 10 mM methionine, at pH 8.6. After gentle overnight 
10 stirring, the re-folded product is isolated by reverse phase HPLC with conventional protocols. 

Recombinant Expression Vectors and Host Cells 

The polynucleotide sequences described herein can be used m recombinant DNA molecules 
that direct the expression of the corresponding polypeptides in appropriate host cells. Because of the 
15 degeneracy in the genetic code, other DNA sequences may encode die equivalent amino acid 

sequence, and may be used to clone and express the CPPs. Codons preferred by a particular host cell 
may be selected and substituted into the naturally occurring nucleotide sequences, to increase the rate 

\J?# may oe inserted into a repncabie vector lor clonmg (ampiiticauon 01 me UNA ), or tor 
20 expression. The polypeptide can be expressed recombinantly in any of a number of expression 
systems according to methods known in the art (Ausubel, et al., editors, Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1990). Appropriate host cells include yeast, 
bacteria, archebacteria, fungi, and insect and animal cells, including mammalian cells, for example 
primary cells, including stem cells, including, but not limited to bone marrow stem cells. More 
25 specifically, these include, but are not limited to, microorganisms such as bacteria transformed with 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors, and yeast transformed with 
yeast expression vectors. Also included, are insect cells infected with a recombinant insect virus 
(such as baculovirus), and mammalian expression systems. The nucleic acid sequence to be expressed 
may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
30 appropriate restriction endonuclease site using techniques known in the art. Vector components 
generally include, but are not limited to, one or more of a signal sequence, an origin of replication, 
one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. 
Construction of suitable vectors containing one or more of these components employs standard 
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ligation techniques which are known to die skilled artisan. 

The CPPs of the present invention are produced by culturing a host cell transformed with an 
expression vector containing a nucleic acid encoding a CPP, under the appropriate conditions to 
induce or cause expression of the protein. The conditions appropriate for CPP expression will vary 
5 with the choice of the expression vector and the host cell, as ascertained by one skilled in the art For 
example, the use of constitutive promoters in the expression vector may require routine optimization 
of host cell growth and proliferation, while the use of an inducible promoter requires the appropriate 
growth conditions for induction, In addition, in some embodiments, the timing of the harvest is 
important For example, the baculo viral systems used in insect cell expression are lytic viruses, and 

1 0 thus harvest time selection can be crucial for product yield. 

A host cell strain may be chosen for its ability to modulate the expression of the inserted 
sequences or to process the expressed protein in the desired fashion. Such modifications of the protein 
include, but are not limited to, glycosyl, acetyl, phosphate, amide, lipid, carboxyl, acyl, or 
carbohydrate groups. Post-translational processing, which cleaves a "prepro" form of the protein, may 

15 also be important for correct insertion, folding and/or function. By way of example, host cells such as 
CHO, HeLa, BHK, MDCK, 293, W138,«tc. have specific cellular machinery and characteristic 
mechanisms for such post-translational activities and may be chosen to ensure the correct 

metanogaster ceils, Hacciiaromyces cerevisiae and otner yeasts, is. coil, tsaciaus suoutis, ceiis, 
20 C129 cells, 293 cells, Neurospora, BHK, CHO, COS, and HeLa ceils, fibroblasts, Schwanoma cell 
lines, immortalized mammalian myeloid and lymphoid cell lines, Jurkat cells, human cells and other 
primary cells. 

The nucleic acid encoding a CPP must be "operably linked" by placing it into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory 

25 leader is operably linked to DNA for a polypeptide if it is expressed as a preprolein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it 
affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to facilitate translation. Generally, "operably linked" DNA sequences 
are contiguous, and, in the case of a secretory leader or other polypeptide sequence, contiguous and in 

30 reading phase. However, enhancers do not have to be contiguous. Linking is accomplished by ligation 
at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or 
linkers are used in accordance with conventional practice. Promoter sequences encode either 
constitutive or inducible promoters. The promoters may be either naturally occurring promoters or 
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hybrid promoters. Hybrid promoters, which combine elements of more than one promoter, are also 
known in the art, and are useful in the present invention. The expression vector may comprise 
additional elements, for example, the expression vector may have two replication systems, thus 
allowing it to be maintained in two organisms, for example in mammalian or insect cells for 
expression and in a procaryotic host for cloning and amplification. Both expression and cloning 
vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected 
host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of 
replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2: plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are 
useful for cloning vectors in mammalian cells. Further, for integrating expression vectors, the 
expression vector contains at least one sequence homologous to the host cell genome, and preferably, 
two homologous sequences which flank the expression construct. The integrating vector may be 
directed to a specific locus in the host cell by selecting the appropriate homologous sequence for 
inclusion in the vector. Constructs for integrating vectors are well known in the art In an additional 
embodiment, a heterologous expression control element may be operably linked with the endogenous 
gene in the host cell by homologous recombination (described in US Patents 6410266 and 6361972, 
disclosures of which are hereby incorporated by reference in their entireties). This technique allows 



processing and modification Of** enaogenousiy expressed oy me ttost cell, useiui Heterologous 
expression control elements include but are not limited to CMV immediate early promoter, die HS V 
thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous Sarcoma Virus (RSV), and metallothionein promoters. 

Preferably, the expression vector contains a selectable marker gene to allow the selection of 
transformed host cells. Selection genes are well known in the art and will vary with the host cell 
used. Expression and cloning vectors will typically contain a selection gene, also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other 
toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic 
deficiencies, or (c) supply critical nutrients not available for from complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. 

Host cells transformed with a nucleotide sequence encoding a CPP may be cultured under 
conditions suitable for the expression and recovery of the encoded protein from cell culture. The 
protein produced by a recombinant cell may be secreted, membrane-bound, or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood by those of 
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skill in the art, expression vectors containing polynucleotides encoding the CPP can be designed with 
signal sequences which direct secretion of the CPP through a prokaryotic or eukaryotic cell 
membrane. The desired CPP may be produced recombinantly not only directly, but also as a fusion 
polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide 
5 having a specific cleavage site at the N-ternrinus of the mature protein or polypeptide. In general, the 
signal sequence may be a component of the vector, or it may be a part of the CPP-encoding DNA that 
is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, for 
example, from the group of the alkaline phosphatase, penicillinase, lpp, or heat-stable enterotoxin II 
leaders. For yeast secretion the signal sequence may be, e.g., the yeast invertase leader, alpha factor 

10 leader (including Sacch aromyces and Kluyveromyces a-factor leaders, the latter described in U.S. Pat. 
No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 
published Apr. 4, 1 990), or the signal described in WO 90 1 13646 published Nov. 1 5, 1990. hi 
mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as 

15 viral secretory leaders. According to the expression system selected, the coding sequence is inserted 
into an appropriate vector, which in ton may require the presence of certain characteristic "control 
elements" or "regulatory sequences." Appropriate constructs are known generally in the art (Ausubel, 

_i 1 c*nr\\ — j ^ „_ „ ,:i~v/» _ -i 1; — „ - t^.,.;.^-,^- 

JJiego, CaJir,>, &&amgenexLa iotta, uam.>, xjr&ctr iskl, QKOCKvme, Mdj or Ciontecn ^aio Aito, 
20 Calif). 

Expression in Bacterial Systems 

Transformation of bacterial cells may be achieved using an inducible promoter such as the 
hybrid lacZ promoter of the "BLUESCRIPT" Phagerrrid (Stratagene) or "pSPORTl" (Gibco BRL). In 

25 addition, a number of expression vectors may be selected for use in bacterial cells to produce 

cleavable fusion proteins that can be easily detected and/or purified, including, but not limited to 
"BLUES CRIPT' (a-galactosidase; Stratagene) or pGEX (glutathione S-transferase; Promega, 
Madison, Wis.). A sui table bacterial promoter is any nucleic acid sequence capable of binding 
bacteria] RNA polymerase and initiating the downstream (3') transcription of the coding sequence of 

30 the CPP gene into mKNA. A bacteria] promoter has a transcription initiation region which is usually 
placed proximal to the 5' end of the coding sequence. This transcription initiation region typically 
includes an RNA polymerase binding site and a transcription initiation site. Sequences encoding 
metabolic pathway enzymes provide particularly useful promoter sequences. Examples include 
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promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose and maltose, 
and sequences derived from bio synthetic enzymes such as tryptophan. Promoters from bacteriophage 
may also be used and are known in the art. In addition, synthetic promoters and hybrid promoters are 
also useful; for example, the tat promoter is a hybrid of the tip and lac promoter sequences. 
Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin 
mat have the ability to bind bacterial RNA polymerase and initiate transcription. An efficient 
ribosome-bmding site is also desirable. The expression vector may also include a signal peptide 
sequence that provides for secretion of the CPP in bacteria. Hie signal sequence typically encodes a 
signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell, as is well known in the art. The protein is either secreted into the growth media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and outer membrane of the 
cell (gram-negative bacteria). The bacterial expression vector may also include a selectable marker 
gene to allow for the selection of bacterial strains that have been transformed. Suitable selection genes 
include drug resistance genes such as ampicilrm, chloramphenicol, erythromycin, kanamycin, 
neomycin and tetracycline. Selectable markers also include biosynthetic genes, such as those in the 
histidine, tryptophan and leucine biosynthetic pathways. When large quantities of CPPs are needed, 
e.g,, for the induction of antibodies, vectors which direct high level expression of fusion proteins that 

colt cloning and expression vectors sucix as ui^uissutori {Straiagenej, in wnicti uie erf cooing 
sequence may be ligated into the vector in-frame with sequences for the ammo-terminal Met and the 
subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; PIN vectors (Van 
Heeke & Schuster JBiol Chem 264:5503-5509 1989)); PET vectors (Novagen, Madison Wis.); and 
the like. Expression vectors for bacteria include the various components set forth above, and are well 
known in the art Examples include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, and 
Streptococcus Uvidans, among others. Bacterial expression vectors are transformed into bacterial host 
cells using techniques well known in the art, such as calcium chloride mediated transfection, 
electroporation, and others. 

Expression in Yeast 

Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansemda polymorpha, Kluyveromyces 
fragilis and K. lactis, Pichia guillermondii and P pastoris, Schizosaccharomyces pombe, and 
Yarrowia lipolytics Examples of suitable promoters for use in yeast hosts include the promoters for 
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3-phosphoglycerate kinase (Hitzeman et aL, J. Biol Chenu 255:2073 (1980)) or other glycolytic 
enzymes (Hess et aL, J. Adv. Enzyme Reg. 7:149 (1968); Holland, Biochemistry 17:4900 (1978)), such 
as enolase, glyceraldehyde-3- phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 



ise, 



phosphofructokinase, glucose- 6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate 
triosephosphate isomerase, phosphoghicose isomerase, alpha factor, the ADH2IGAPDH promoter, 
glucokinase alcohol oxidase, and PGH. See, for example, Ausubel, et al., 1990; Grant et al., Methods 
m Enzymology 153:5 16-544, (1987). Other yeast promoters, which are inducible have the additional 
advantage of transcription controlled by growth conditions, include the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism, metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible 
for maltose and galactose utilization. Suitable vectors and promoters for use in yeast expression are 
further described in EP 73,657. Yeast selectable markers include ADE2. HIS4. LEU2. TRP1. and 
ALG7, which confers resistance to tunicamycin; the neomycin phosphotransferase gene, which 
confers resistance to G418; and the CUP1 gene, which allows yeast to grow in the presence of copper 
ions. Yeast expression vectors can be constructed for intracellular production or secretion of a GPP 



from the DNA encoding the CPP of interest. For example, a selected signal peptide and the 
appropriate constitutive or inducible promoter may be inserted into suitable restriction sites in the 



encoding the CPP can be cloned into the selected plasmid, together with DNA encoding the promoter, 
the yeast alpha-factor secretory signal/leader sequence, and linker sequences (as needed), for 
expression of the GPP. Yeast cells, can then be transformed with the expression plasmids described 
above, and cultured in an appropri ate fermentation media. The protein produced by such transformed 
yeast can then be concentrated by precipitation with 10% trichloroacetic acid and analyzed following 
separation by SDS-PAGE and staining of the gels with Goomassie Blue stain. The recombinant GPP 
can subsequently be isolated and purified from the fermentation medium by techniques known to 
those of skill in the art 



Expression in Mammalian Systems 

The CPP may be expressed in mammalian cells. Mammalian expression systems are known in 
30 the art, and include retroviral vector mediated expression systems. Mammalian host cells may be 
transformed with any of a number of different viral-based expression systems, such as adenovirus, 
where the coding region can be ligated into an adenovirus transcription/translation complex consisting 
of the late promoter and tripartite leader sequence. Insertion in a nonessential £1 or £3 region of the 
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viral genome results in a viable virus capable of expression of the polypeptide of interest in infected 
host cells. A preferred expression vector system is a retroviral vector system such as is generally 
described in PCT/US97/01019 and PCT/US97/ 101048. Suitable mammalian expression vectors 
contain a mammalian promoter which is any DNA sequence capable of binding mammalian RNA 
5 polymerase and initiating fee downstream (3*) transcription of a coding sequence for CPP into 

mRNA. A promoter will have a transcription initiating region, which is usually placed proximal to the 
5' end of the coding sequence, and a TATA box, using a located 25-30 base pairs upstream of the 
transcription initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA 
synthesis at the correct site. A mammalian promoter will also contain an upstream promoter element 

10 (enhancer element), typically located within 100 to 200 base pairs upstream of the TATA box. An 

upstream promoter element determines the rate at which transcription is initiated and can act in either 
orientation. Of particular use as mammalian promoters are the promoters from mammalian viral 
genes, since the viral genes are often highly expressed and have a broad host range. Examples include 
promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1 , 

1 5 504 published Jul. 5, 1 989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma 
virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous 
mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 

encoding a CPP by higher eukaryotes may be increased by inserting an ennancer sequence into tne 
20 vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a 
promoter to increase its transcription. Many enhancer sequences are now known from mammalian 
genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer, the cytomegalovirus 
early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and 
25 adenovirus enhancers. The enhancer is preferably located at a site 5' from the promoter. In general, 
the transcription termination and polyadenylation sequences recognized by mammalian cells are 
regulatory regions located 3* to the translation stop codon and thus, together with the promoter 
elements, flank the coding sequence. The 3 ' terminus of the mature mRNA is formed by site-specific 
post-translational cleavage and polyadenylation. Examples of transcription terminator and 
30 polyadenylation signals include those derived from SV40. Long term, high-yield production of 

recombinant proteins can be effected in a stable expression system. Expression vectors which contain 
viral origins of replication or endogenous expression elements and a selectable marker gene may be 
used for this purpose. Appropriate vectors containing selectable markers for use in mammalian cells 



48 



WO 2004/106941 



PCT/EP2004/005786 



are readily available commercially and are known to persons skilled in the art Examples of such 
selectable markers include, but are not limited to herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase for use intk-or bprt-cells, respectively. The methods of introducing 
exogenous nucleic acid into mammalian hosts, as well as other hosts, is well known in the art, and 
will vary with the host cell used* Techniques include dextran-mediated transfection, calcium 
phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, viral 
infection, encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 
into nuclei. 

CPPs can be purified from culture supernatants of mammalian cells transiently transfected or 
stably transformed by an expression vector carrying a CPP-encoding sequence. Preferably, CPP is 
purified from culture supernatants of COS 7 cells transiently transfected by the pcD expression 
vector. Transfection of COS 7 cells with pcD proceeds as follows: One day prior to transfection, 
approximately 10 6 COS 7 monkey cells are seeded onto individual 100 mm plates in Dulbecco's 
modified Eagle medium (DME) containing 10% fetal calf serum and 2 mM glutamine. To perform 
the transfection, the medium is aspirated from each plate and replaced with 4 ml of DME containing 
50 mM Tris.HCl pH 7.4, 400 mg/ml DEAE-Dextran and 50 ng of plasmid DNA. The plates are 
incubated for four hours at 37°C, then the DNA-containing medium is removed, and the plates are 

incubated for an additional 3 hrs at 37°C. The plates are washed once with DME, after which DME 
containing 4% fetal calf serum, 2 mM glutamine, penicillin (100 U/L) and streptomycin (100 fig/L) at 

standard concentrations is added. The cells are then incubated for 72 hrs at 37°C, after which the 
growth medium is collected for purification of CPP. Plasmid DNA for the trans fections is obtained 
by growing pcD(SRa), or like expression vector, containing the CPP-encoding cDNA insert in E. coli 
MC1061 (described by Casadaban and Cohen, J. Mol. Biol., Vol. 138, pgs. 179-207 (1980)), or like 
organism. The plasmid DNA is isolated from the cultures by standard techniques, e.g. Sambrook et 
al., Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Harbor Laboratory, New 
York, 1989) or Ausubel et al (1990, cited above). 

Expression in Insect Cells 

CPPs may also be produced in insect cells. Expression vectors for the transformation of insect 
cells, and in particular, baculo virus-based expression vectors, are well known in the art hi one such 
system, the CPP-encoding DNA is fused upstream of an epitope tag contained within a baculovinis 
expression vector. Antographa California* nuclear polyhedrosis virus (AcNPV) is used as a vector to 
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express foreign genes in Spodopterafrugiperda Sf9 cells or in Trichoplusia larvae. Hie CPP-encoding 
sequence is cloned into a nonessential region of the virus, such as the polyhedrin gene, and placed 



under control of the polyhedrin promoter. Successful insertion of a CPP-encoding sequence will 
render the polyhedrin gene inactive and produce recombinant virus lacking coat protein coat The 
recombinant viruses are then used to infect S. frugiperda cells or Trichoplusia larvae in which the 
CPP is expressed (Smith et al., J. Wol. 46:584 (1994); Engelhard EKet al, Proc. Nat Acad, Sri. 
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tags and immunoglobulin tags (like Fc regions of IgG). A variety of plasrrrids may be employed, 
including com m ercially available plasmids such as pVL1393 (Novagen). Briefly, the CPP-encoding 
DNA or the desired portion of the CPP-encoding DNA is amplified by PCR with primers 
complementary to the 5* and 3* regions. Hie 5* primer may incorporate flanking restriction sites. Hie 
PCR product is then digested with the selected restriction enzymes and subcloned into an expression 
vector. Recombinant baculovirus is generated by co-transfecting the above plasmid and 
BaculoGoldTM virus DNA (Pharmingen) into Spodoptera frugiperda ("SfP") cells (ATCC CRL 
171 1) using Irpofectin (commercially available from GIBCO-BRL), or other methods known to those 
of skill in the art Virus is produced by day 4-5 of culture in Sf9 cells at 28°C, and used for further 
amplifications. Procedures are performed as further described in O'Reilley et al., BACULOVIRUS 



be prepared tromie co ii ibi i iant v irus-inleetea SI9 cells as described in Rupert et al., Mature SbZ: 175- 
179 (1993). Alternatively, expressed epitope-tagged CPP can be purified by affinity chromatography, 
or for example, purification of an IgG tagged (or Fc tagged) CPP can be performed using 
chromatography techniques, including Protein A or protein G column chromatography. 



Evaluation of Gene Expression 

25 Gene expression may be evaluated in a sample directly, for example, by standard techniques 

known to those of skill in the art, e.g., Northern blotting to determine the transcription of mRNA, dot 
blotting (DNA or RNA), or in situ hybridization, using an appropriately labeled probe, based on the 
sequences provided herein. Alternatively, antibodies may be used in assays for detection of 
polypeptides, nucleic acids, such as specific duplexes, including DNA duplexes, RNA duplexes, and 

30 DNA-RNA hybrid duplexes or DNA-protein duplexes. Such antibodies may be labeled and the assay 
carried out where the duplex is bound to a surface, so that upon the formation of duplex on the 
surface, the presence of antibody bound to the duplex can be detected. Gene expression, alternatively, 
may be measured by immunohistochemical staining of cells or tissue sections and assay of cell culture 
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or body fluids, to directly evaluate the expression of a CPP polypeptide ox polynucleotide. Antibodies 
useful for such immunological assays may be either monoclonal or polyclonal, and may be prepared 
against a native sequence CPP. Protein levels may also be detected by mass spectrometry. A further 
method of protein detection is with protein chips. 
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Purification of Expressed Protein 

Expressed CPP may be purified or isolated after expression, using any of a variety of methods 
known to those skilled in the art The appropriate technique will vary depending upon what other 
components are present in the sample. Contaminant components mat are removed by isolation or 
purification are materials that would typically interfere with diagnostic or therapeutic uses for the 
polypeptide, and may include enzymes, hormones, and other solutes. The purification step(s) selected 
will depend, for example, on the nature of the production process used and the particular CPP 
produced. As CPPs are secreted, they may be recovered from culture medium. Alternatively, the 
CPP may be recovered from host cell lysates. If membrane-bound, it can be released from the 
membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic cleavage. 
Alternatively, cells employed in expression of CPP can be disrupted by various physical or chemical 
means, such as freeze-thaw cycling, sonication, mechanical disruption, or by use of cell lysing agents. 



chromatography; chromatography usmgsilica gel or a cation-exchange resin such as DHAh; gel 
filtration using, for example, Sephadex G-75; protein A Seph arose columns to remove contaminants 
such as IgG; chromatography using metal chelating columns to bind epi tope-tagged forms of the CPP; 
ethanol precipitation; reverse phase HPLC; chromatofocusing; SDS-PAGE; and ammonium sulfate 
precipitation. Ordinarily, an isolated CPP will be prepared by at least one purification step. For 
example, the CPP may be purified using a standard anti-CPP antibody column. Ultrafiltration and 
dialysis techniques, in conjunction with protein concentration, are also useful (see, for example, 
Scopes, R., PROTEIN PURIFICATION, Springer-Verlag, New York, N.Y., 1982). The degree of 
purification necessary will vary depending on the use of the CPP. In some instances no purification 
will be necessary. Once expressed and purified as needed, the CPPs and nucleic acids of the present 
invention are useful in a number of applications, as detailed herein. 

Transgenic animals 

The host cells of the invention can also be used to produce nonhuman transgenic animals. For 
example, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem 
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cell into which CPP-coding sequences have been introduced. Such host cells can then be used to 
create non-human transgenic animals in which exogenous CPP sequences have been introduced into 
their genome or homologous recombinant animals in which endogenous CPP sequences have been 
altered. Such animals are useful for studying the function and/or activity of a CPP or fragment thereof 
5 and for identifying and/or evaluating modulators of CPP biological activity. As used herein, a 

"transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the animal include a transgene. Other examples of 
transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. 
A transgene is exogenous DNA which is integrated into the genome of a cell from which a transgenic 

10 animal develops and which remains in the genome of the mature animal, thereby directing the 

expression of an encoded gene product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animar is a non-human animal, preferably a mammal, 
more preferably a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a cell 

15 of the animal, e.g., an embryonic cell of the animal, prior to development of die animal. 

A transgenic animal of the invention can be created by introducing a CPP-encoding nucleic 
acid into the male pronuclei of a fertilized oocyte, e.g., by microinjection or retroviral infection, and 

a fragmen t thereof can be introduced as a transgene into the genome of a non-human animal. 

20 Alternatively, a nonhuman homologue of a human CPP-encoding gene, such as from mouse or rat, can 
be used as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of die transgene. A tissue-specific regulatory 
sequence(s) can be operably linked to a CPP transgene to direct expression of a CPP to particular 
cells. Methods for generating transgenic animals via embryo manipulation and microinjection, 

25 particularly animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat Nos. 4,736,866 and 4,870,009, both by Leder et at, U.S. Pat No. 4,873,191 by 
Wagner et al. and in Hogan, B., Manipulating die Mouse Embryo, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y*, 1986, the disclosure of which is incorporated herein by reference in 
its entirety). Similar methods are used for production of other transgenic animals. A transgenic 

30 founder animal can be identified based upon the presence of a CPP transgene in its genome and/or 
expression of CPP mRNA in tissues or cells of the animals. A transgenic founder animal can then be 
used to breed additional animals carrying the transgene. Moreover, transgenic animals carrying a 
transgene encoding a CPP can further be bred to other transgenic animals carrying other transgenes. 
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To create an animal in which a desired nucleic acid has been introduced into the genome via 
riogous recombination, a vector is prepared which contains at least a portion of a CPP-encoding 
sequence into which a deletion, addition or substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the CPP-encoding sequence. The CPP-encoding sequence can be a human gene, 
but more preferably, is a non-human homologue of a human CPP-encoding sequence (e.g., a cDNA 
isolated by stringent hybridization with a nucleotide sequence coding for a CPP). For example, a 

■ 

mouse CPP-encoding sequence can be used to construct a homologous recombination vector suitable 
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designed such that, upon homologous recombination, the endogenous CPP-encoding sequence is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a "knock out 11 
vector). Alternatively, the vector can be designed such that, upon homologous recombination, the 
endogenous CPP-encoding sequence is mutated or otherwise altered but still encodes functional 
protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of the 
endogenous CPP-encoding sequence). In the homologous recombination vector, the altered portion of 
the CPP-encoding sequence is flanked at its 5' and 3 1 ends by additional nucleic acid sequence of the 
CPP gene to allow for homologous recombination to occur between the exogenous sequence carried 
by the vector and an endogenous gene in an embryonic stem cell. The additional flanking nucleic acid 



Typically, several fetfobases ot tlanfcmg DNA (both at the 5* and T ends) are included in the vector 
(see e.g., Thomas, K. R. and Capecchi, M. R. (1987) Cell 51:503, the disclosure of which is 
incorporated herein by reference in its entirety, for a description of homologous recombination 
vectors). The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells 
in which the introduced CPP-encoding sequence has homologously recombined with the endogenous 
gene are selected (see e.g., Li, E. et al. (1992) Cell 69:915, the disclosure of which is incorporated 
herein by reference in its entirety). The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Stem Cells. A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152, 
the disclosure of which is incorporated herein by reference in its entirety). A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to 
term. Progeny harboring the homologously recombined DNA in their germ cells can be used to breed 
animals m which all cells of the animal contain the homologously recombined DNA by germline 
tr an s mi ssion of the transgene. Methods for constructing homologous recombination vectors and 
homologous recombinant animals are described further in Bradley, A. (1991) Current Opinion in 
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Biotechnology 2:823-829 and in PCT International Publication Nos.: WO 90/1 1354 by Le Mouellec 
et aL; WO 91/01 140 by Smithies et al.; WO 92/0968 by Zijlstra et al.; and WO 93/04169 by Berns et 
al., the disclosures of which are incorporated herein by reference in their entireties. 

hi another embodiment, transgenic non-human animals can be produced which contain 
selected systems which allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al. (1992) PNAS 89:6232-6236, die disclosure of which is 
incorporated herein by reference in its entirety. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gonnan et al. (1991) Science 251:1351-1355, 
the disclosure of which is incorporated herein by reference in its entirety). If a cre/loxP recombinase 
system is used to regulate expression of the transgene, animals containing transgenes encoding both 
die Cre recombinase and a selected protein are required. Such animals can be provided through the 
construction of "double* 1 transgenic animals, e.g., by mating two transgenic animals, one containing a 
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Assessing CPP activity 

It will be appreciated that the invention further provides methods of testing the activity of or 
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providing a variant orrnodrrietr AJ4*t*-encoamg nucxeic acia and assessing wnetner toe encoaed 
polypeptide displays a CPP biological activity. Encompassed is thus a method of assessing the 
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thereof; and (b) testing said CPP, or a biologically active fragment or homologue thereof for a CPP 
biological activity under conditions suitable for CPP activity. Cell free, cell-based and in vivo assays 
may be used to test CPP activity. For example, said assay may comprise expressing a CPP nucleic 
acid in a host cell, and observing CPP activity in said cell and other affected cells. In another 
example, a CPP, or a biologically active fragment or homologue thereof is contacted with a cell, and a 
CPP biological activity is observed. 

CPP biological activities include: (1) indicating that an individual has or will have a 
cardiovascular disorder, (2) circulating through the bloodstream of individuals with a cardiovascular 
disorder; (3) antigenicity, or the ability to bind an anti-CPP specific antibody; (4) immunogenicity, or 
the ability to generate an anti-CPP specific antibody; and for CPP 12: (5) forming intermolecular 
amino acid side chain interactions such as hydrogen, amide, or preferably disulfide links; (6) 
interaction with a CPP target molecule, preferably a bacterial endotoxin; (7) neutralizing bacterial 
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endotoxins; (8) promoting mast cell chemotaxis; (9) undergoing posttranslational processing, for 
example, specific proteolysis; and (10) functioning as an antimicrobial defense in the gut and female 
reproductive tract; for CPP 15; (5) forming intermolecular amino acid side chain interactions such as 
hydrogen, amide, or preferably disulfide links; (6) interaction with a CPP target molecule, preferably 
complement protein C9; (7) inhibiting complement lysis; and (8) undergoing posttranslational 
processing, for example, specific proteolysis and glycosylation; for CPP 16: (5) interaction with a 
CPP target molecule, preferably NAD/NADP; and for CPP 18: (5) having a posttranslational 
modification, such as glycosylation; and (6) interaction with a CPP target molecule, preferably an 
integrin, microfibril or fibrillin. 

CPP biological activity can be assayed by any suitable method known in the art. Antigenicity 
and immunogenicity may be detected, for example, as described in the sections titled "Anti CPP 
antibodies" and "Uses of CPP antibodies". Circulation in blood plasma may be detected as described 
in 'Diagnostic and Prognostic Uses". Determining the ability of the CPP to bind to or interact with a 
CPP target molecule can be accomplished by a method for directly or indirectly determining binding, 
as is common to the art Such methods are further described in the section titled te Drug Screening 
Assays." 

Cardiovascular disorders may be diagnosed by any method determined appropriate for an 
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the Background section, and are best determined appropriately by one ot skill in the art based on the 
particular profile of a patient 

Intramolecular interactions may be detected by sequence-based structural predictions. Such 
predictions are generally based cm X-ray crystallography or NMR structural data for a polypeptide 
with similar sequence. Detection of intramolecular interactions may also be accomplished using 
SDS-P AGE. For the example of disulfide bonds, links formed between different portions of a given 



bonds may be disrupted by a reducing agent, for example, dithiothreitol (DTT). A protein sample that 



has been treated with a reducing agent may thus be compared to an untreated control by SDS-PAGE 
to detect a change in apparent molecular weight. Such methods are common to the art 

Specific proteolysis may be detected by comparing the molecular weight of a sample peptide 
to that of a peptide of known molecular weight Molecular weights are easily compared according to 



any method common to the art such as SDS-PAGE, gel chromatography, or mass spectrometry. 
Preferably, the molecular weight of a test peptide is obtained by mass spectrometry and compared to a 
database comprising molecular weights of peptides with posttranslational modifications. Exemplary 
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databases include Genpept, SWISSPROT, EMBL, and the Protein Sequence Database. Such 
techniques are detailed further herein. 

For CPP 12, antimicrobial activity may be detected according to any method known in the art, 
5 such as those described by Porter et al. (InfectJmmuiL 65: 2396-2401 (1997)). Briefly, L. 

monocytogenes EGD, E. coli ML35p, & typhimurium 1402BS and 7953S, and C. albicans 820 are 
used. The bacterial cultures are harvested in midgrowth phase, washed, and resuspended at a working 
dilution of 10 6 bacteria/ml in 10 mM sodium phosphate (pH7.4), 1% Trypticase Soy Broth. C 
albicans is harvested in the same manner but isolated in stationary phase. Colony Forming Unit 
10 (CFU) assays and Radial diffusion assays are performed as described in the presence of a test 

compound, a positive control compound, or no compound. A reduction in the value obtained for either 
the CFU or radial diffusion assays in the presence of a test compound as compared to the control 
without compound indicates that the compound has antimicrobial activity. 

For CPP 12, the presence of a compound or peptide in the gut or reproductive tract is 
1 5 detectable by common methods (e.g., immunofluorescent microscopy or ELISA on biological fluid 

samples, as further described herein) or as described by Jones and Bevins (JBC 267:23216-25 (1992)) 
or Svinorich, et al. (Am J Obstet Gynecol (1997) 176:470-5). 

i*neiry, iympRooyte mi^aoon wasassessea using a <*<5 wen nnurociicniuiaiuc* vcauuott 
20 (Neuro Probe Inc. Cabin John, McL). 25 ul of the sample to be tested diluted in chemotaxis medium 
was placed in the lower compartment and 50 ul of cell suspension (at 5 x 1 0 6 cells/ ml) in the upper 
compartment. The two compartments were separated by a polycarbonate filter (for T cells, 5 um pore 
size, larger for other lymphocytes) coated with 10 ug/ml collagen type IV overnight at 4C. The 
apparatus was incubated at 37C for 3 h in humidified air with 5% CO2. At the end of the incubation 
25 period, the filter was removed, fixed and stained with LeukoStat (Fisher Scientific, Pittsburgh, Pa.). 
The number of cells that migrate through the filter were counted by light microscopy per high-power 
fields. The results are expressed as the mean (±SD) value of the migration in triplicate samples and 
are representative of at least three experiments. The statistical significance of the number of cells 
migrating in response to stimuli versus control medium was calculated using the Student's T test For 
30 the in vivo assay, BALB/C and CB-1 7 scid/scid (SCID) mice were obtained from the Animal 
Production Area (NCI-FCRDC, Frederick, M&). Mice were used at 8-12 wk of age and kept in 
pathogen free conditions. SCID mice were treated with anti-ASGM-1 and injected with I * 10 8 huPBL 
Lp. Immediately thereafter 0.1 ml PBS containing purified preparations of 1.0 ug compound or control 
PBS were injected daily subcutaneously into the same injection site. The injection site was examined 
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histologically either at 4 hr after the first or at 24 hr by 4 hr after a second injection. Experiments 
were performed on three to four mice per group in duplicate. 

For CPP 15, to assay for complement lysis, methods such as those described in WO 9302183 
5 (relevant disclosure of which is incorporated herein) may be used. Briefly, the cDNA encoding the 
protein of interest is stably incorporated into the genome of a porcine aortic endothelial cell (PAEC) 
and expressed on the cell surface. Cultures of recombinant PAEC are cultured in DMEM containing 
10% fetal bovine serum (FBS), 5 mM Hepes, 2 mM L-ghrtamine, and 1 % each of penicillin and 
streptomycin (P/S) at 37C in 8 % C02. After transfection or transduction, cell surface expression of 
10 the protein of interest (e.g., CPP) may be detected by flow cytometric techniques. The recombinant 
PAEC are assayed for their sensitivity to cytolysis by complement in human serum. To do this, 
recombinant PAEC and control PAEC are plated into 48-well tissue culture plates at a density of 1 .25 
x 10 6 cells/well in DMEM with 10% FBS, 2 mM glutamine and P/S. The culture media is removed 
and the cells washed three times with media without FBS. Next, human serum diluted in DMEM at 
15 various concentrations is added to the cultures for 2 hours at 37C. The percentage of viable cells 
remaining in each culture is assessed by staining with 0. 1 % trypan blue. 

For CPP 15, the presence of a GPI-anchor may be detected as described in WO 9936778, 

the GFl-anchor moiety are^xpfortetf as detection agents. *YeierabIy, sucii toxins are cietectably 
20 labelled and the assay is carried out on a plasma sample. 

For CPP 18, N-linked glycosylation may be determined, for example, as described in EP 
Patent 0693559B1. Other common methods include comparison of observed and expected molecular 
weights for an arginme-containing peptide sequence, wherein a higher than expected molecular 
25 weight indicates that the peptide is glycosylated. Molecular weight may be detected as described in 
detail herein. 

AnH-CPP Antibodies 

The present invention provides antibodies and binding compositions specific for CPPs. Such 
30 antibodies and binding compositions include polyclonal antibodies, monoclonal antibodies, Fab and 
single chain Fv fragments thereof, bispecific antibodies, heteroconjugates, and humanized antibodies. 
Such antibodies and binding compositions may be produced in a variety of ways, including hybridoma 
cultures, recombinant expression in bacteria or mammalian cell cultures, and recombinant expression 
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in transgenic animals. There is abundant guidance in the literature for selecting a particular 
production methodology, e.g. Chadd and Chamow, Out. Opin. Biotechnol., 12: 188-194 (2001). 

The choice of manufacturing methodology depends on several factors including the antibody 
structure desired, the importance of carbohydrate moieties on the antibodies, ease of culturing and 
purification, and cost Many different antibody structures may be generated using standard 
expression technology, including full-length antibodies, antibody fragments, such as Fab and Fv 
fragments, as well as chimeric antibodies comprising components from different species. Antibody 
fragments of small size, such as Fab and Fv fragments, having no effector functions and limited 
pharmoktnetic activity may be generated in a bacterial expression system. Single chain Fv fragments 
are highly selective for in vivo tumors, show good tumor penetration and low immunogenichy, and 
are cleared rapidly from the blood, «.g. Freyre et al, J. Biotechnol., 76: 157-163 (2000). Thus, such 
molecules are desirable for radioimmunodetectioiL 



Polyclonal Antibodies 

The anti-CPP antibodies of the present invention may be polyclonal antibodies. Such 
polyclonal antibodies can be produced in a mammal, for example, following one or more injections of 
an immunizing agent, and preferably, an adjuvant. Typically, the immunizing agent and/or adjuvant 
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immunizing agent may include AJfiFs tjr a rusion protein tnereot. it may oe useiui to conjugate tne 
antigen to a protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include, but are not limited to, keyhole limpet hemocyanin (KLH), methylated 



bovine serum albu 







IMI 





HUM 



(BS A), Hepatitis B surface antigen, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, for example, 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 



dicoryno-mycolate). Hie immunization protocol may be determined by one skilled in the art based on 
standard protocols or by routine experimentation. 

Alternatively, a crude protein preparation which has been enriched for a CPP or a portion 
thereof can be used to generate antibodies. Such proteins, fragments or preparations are introduced 
into the non-human mammal in the presence of an appropriate adjuvant. If the serum contains 
polyclonal antibodies to undesired epitopes, die polyclonal antibodies are purified by immunoaffinity 
chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and dose, with 
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both inadequate and excessive doses of antigen resulting in low titer antisera. Small doses (ng level) 
of antigen administered at multiple intradermal sites appear to be most reliable. Techniques for 
producing and processing polyclonal antisera are known in the art, see for example, Mayer and 
Walker (1987), the disclosure of which is incorporated herein by reference in its entirety. An effective 
5 immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 
33:988-991(1971), the disclosure of which is incorporated by reference in its entirety. Booster 
injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as 
determined semi-quantitatively, for example, by double immxmodiffusion in agar against known 
concentrations of the antigen, begins to fall. See, for example, Ouchteriony, O. et al., Chap. 19 in: 
10 Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of 

antibody is usually m the range of 0.1 to 0.2 mg/ml of serum. Affinity of the antisera for the antigen is 
determined by preparing competitive binding curves, as described, for example, by Fisher, D., Chap. 
42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., 
Washington, D. C. (1 980). 

15 

Monoclonal Antibodies 

Alternatively, the anti-CPP antibodies may be monoclonal antibodies. Monoclonal antibodies 



immunized with an immunizing ag^it to elicit iympnocytes mat produce or are capaoieoi producing 
20 antibodies mat will specifically bind to the immunizing agent, e.g. Kohler and Milstein, Nature 

256:495 (1975). The immunizing agent will typically include the CPP or a fusion protein thereof and 
optionally a carrier. Alternatively, me lymphocytes may be immunized in vitro. Generally, spleen 
cells or lymph node cells are used if non-human mammalian sources are desired, of peripheral blood 
lymphocytes C'PBLs") are used if cells of human origin are desired. The lymphocytes are fused with 
25 an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to produce a 
hybridoma cell, e.g. Goding, MONOCLONAL AMWODIES: PRINCIPLES AND PRACTICE, 
Academic Press, pp. 59-103 (1986); Liddell and Cryer, A Practical Guide to Monoclonal Antibodies 
(John Wiley & Sons, New York, 1991); Malik and Lillenoj, Editors, Antibody Techniques (Academic 
Press, New York, 1994). In general, immortalized cell lines are transformed mammalian cells, for 
30 example, myeloma cells of rat, mouse, bovine or human origin. The hybridoma cells are cultured in a 
suitable culture medium mat preferably contains one or more substances that inhibit the growth or 
survival of unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT), the culture medium for the hybridomas 
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typically will include hypoxanthine, aminopterin, and thymidine (HAT), substances which prevent the 
growth of HGFRT-deflcient cells. Preferred immortalized cell lines are those that fuse efficiently, 
support stable high level production of antibody, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine or human myeloma lines, which can be obtained, 
for example, from the American Type Culture Collection (ATCC), Rockville, MD. Human myeloma 
and mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies, e.g. Kozbor, J. Immunol. 133:3001 (1984); Brodeur et al., Monoclonal 
Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, pp. 51-63 (1987). 

The culture medium (supernatant) in which the hybrid oma cells are cultured can be assayed 
for the presence of monoclonal antibodies directed against a CPP. Preferably, the binding specificity 
of monoclonal antibodies present in the hybridoma supernatant is determined by immunoprecipitation 
or by an in vitro binding assay, such as radio- immunoassay (RIA) or Enzyme-Linked Immuno 
Sorbent Assay (ELISA). Appropriate techniques and assays are known in the art The binding affinity 
of the monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, Anal Biochem. 107:220 (1980). After the desired antibody-producing hybridoma cells are 
identified, the cells may be cloned by limiting dilution procedures and grown by standard methods 
(Goding, 1986, supra). Suitable culture media for this purpose include, for example, Dulbecco's 



in vivo as ascites ma mamumx. inz mououtijudi tuntouaies secrexea oy sciecicu ciones may oc 
isolated or purified from the culture medium or ascites fluid by immunoglobulin purification 
procedures routinely used by those of skill in the art such as, for example, protein A-Sepharose, 
hydroxyi-apatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
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be isolated from the CPP-specific hybridoma cells and sequenced, e.g., by using oligonucleotide 
probes that are capable of binding specifically to genes encoding the heavy and light chains of murine 
antibodies. Once isolated, the DNA may be inserted into an expression vector, which is then 
transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma 
cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting 
the coding sequence for the murine heavy and light chain constant domains for the homologous 
human sequences (Morrison et aL, Proc. Nat. Acad, Set 81 :685 1-6855 (1984); Neuberger et al., 
Nature 312:604-608 (1984); Takeda et al., Nature 314:452-454 (1985)), or by covalently joining to 
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the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Hie non-immunoglobulin polypeptide can be substituted for the constant domains of an 
antibody of the invention, or can be substituted for the variable domains of one antigen-combining 
site of an antibody of the invention to create a chimeric bivalent antibody. Hie antibodies may also be 
monovalent antibodies. Methods for preparing monovalent antibodies are well known in the art For 
example, in vitro methods are suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques 
known in die art. 



Anti 



ies and antibody fragments characteristic of hybridomas of the invention can also be 
produced by recombinant means by extracting messenger RNA, constructing a cDNA library, and 
selecting clones which encode segments of the antibody molecule. The following are exemplary 
references disclosing recombinant techniques for producing antibodies: Wall et al., Nucleic Acids 
Research, VoL 5, pgs. 3113-3128 (1978); Zakut et al., Nucleic Acids Research, Vol. 8, pgs. 3591- 
3601 (1980); Cabilly et al., Proc. Nad. Acad. Sci., VoL 81, pgs. 3273-3277 (1984); Boss et aL, 
Nucleic Acids Research, VoL 12, pgs. 3791-3806 (1984); Amster et al., Nucleic Acids Research, Vol. 
8, pgs. 2055-2065 (1980); Moore et aL, U.S. Patent 4,642,334; Skerra et al, Science, VoL 240, pgs. 
1038-1041(1988); Huse et al, Science, VoL 246, pgs. 1275-1281 (1989); and U.S. patents 6,054,297; 

particular, suca techniques can be used to produce interspecuic monoclonal antiooOies, wfterein tne 
binding region of one species is combined with non-binding region -of the antibody of another species 
to reduce inrmunogenicity, e.g. Liu et al., Proc. Natl. Acad. Sci., Vol. 84, pgs. 3439-3443 (1987), and 
patents 6,054,297 and 5,530,101 . Preferably, recombinant^ produced Fab and Fv fragments are 
expressed in bacterial host systems. Preferably, full-length antibodies are produced by mammalian 
cell culture techniques. More preferably, full-length antibodies are expressed in Chinese Hamster 
Ovary (CHO) cells or NSO cells. 

Both polyclonal and monoclonal antibodies can be screened by ELISA. As in other solid 
phase immunoassays, the test is based on the tendency of macromolecules to adsorb nonspecifically to 
plastic. The irreversibility of this reaction, without loss of immunological activity, allows the 
formation of antigen-antibody complexes with a simple separation of such complexes from unbound 
material. To titrate anti-pep tide serum, peptide conjugated to a carrier different from that used in 
immunization is adsorbed to the wells of a 96-well microtiter plate. Hie adsorbed antigen is then 
allowed to react in the wells with dilutions of anti-pep tide serum. Unbound antibody is washed away, 
and the remaining antigen-antibody complexes are allowed to react with an antibody specific for the 
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IgG of the immunized animal. This second antibody is conjugated to an enzyme such as alkaline 
phosphatase. A visible colored reaction produced when the enzyme substrate is added indicates 
which wells have bound antipeptide antibodies. The use of spectrophotometer readings allows better 
quantification of the amount of pep tide-specific antibody bound. High-titer antisera yield a linear 

5 titration curve between 10* 3 and 10~ 5 dilutions. 
CPP peptide carriers 

The invention includes immunogens derived from CPPs, and immunogens comprising 
conjugates between carriers and peptides of the invention. The term immunogen as used herein refers 

10 to a substance which is capable of causing an immune response. The term carrier as used herein 

refers to any substance which when chemically conjugated to a peptide of the invention permits a host 
organism immunized with the resulting conjugate to generate antibodies specific for the conjugated 
peptide. Carriers include red blood cells, bacteriophages, proteins, or synthetic particles such as 
agarose beads. Preferably, carriers are proteins, such as serum albumin, gamma-globulin, keyhole 

1 5 limpet hemocyanin (KLH), thyreoglobulin, ovalbumin, or fibrinogen. 

The general technique of linking synthetic peptides to a carrier is described in several 
references, e.g. Walter and Doolittle, "Antibodies Against Synthetic Peptides," in Setlow et al., eds., 

477-487 (1932); Leroer et aL, Proc. Natl. Acad. Sci., Vol. 78, pgs. 3403-3407 (1981); Shimizu et al., 
20 U.S. Patent 4,474,754; and Ganfield et aL, U.S. Patent 4,3 11,639. Accordingly, these references are 
incorporated by reference. Also, techniques employed to link haptens to carriers are essentially the 
same as the above-referenced techniques, e.g. chapter 20 in Tijssen, Practice and Theory of Enzyme 
Immunoassays (Elsevier, New York, 1985). The four most commonly used schemes for attaching a 
peptide to a carrier are (1) glutaraldehyde for amino coupling, e.g. as disclosed by Kagan and Glick, 
25 in Jaffe and Behrman, eds. Methods of Hormone Radioimmunoassay, pgs. 328-329 (Academic Press, 
N.Y., 1979), and Walter et al. Proc. Natl. Acad. Sci., Vol. 77, pgs. 5197-5200 (1980); (2) water- 
soluble carbodiimides for carboxyl to amino coupling, e.g. as disclosed by Hoare et al., J. Biol. 
Chem., Vol. 242, pgs. 2447-2453 <1967); (3) bis-diazobenzidine (BDB) for tyrosine to tyrosine 
sidechain coupling, e.g. as disclosed by Bassiri et al., pgs. 46-47, in Jaffe and Behrman, eds. (cited 
30 above), and Walter et al. (cited above); and (4) maleimidobenzoyl-N-hydroxysuccinimide ester 

(MBS) for coupling cysteine (or other sulfhydryls) to amino groups, e.g. as disclosed by Kitagawa et 
al., J. Biochem. (Tokyo), Vol. 79, pgs. 233-239 (1976), and Lerner et al. (cited above). A general rule 
for selecting an appropriate method for coupling a given peptide to a protein carrier can be stated as 
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follows: the group involved in attachment should occur only once in the sequence, preferably at the 
appropriate end of the segment For example, BDB should not be used if a tyrosine residue occurs in 
the main part of a sequence chosen for its potentially antigenic character. Similarly, centrally located 
lysines rule out the glutaraldehyde method, and the occurrences of aspartic and glutamic acids 
frequently exclude the carbodiimide approach. On the other hand, suitable residues can be positioned 
at either end of chosen sequence segment as attachment sites, whether or not they occur in the 
"native" protein sequence. Internal segments, unlike the amino and carboxy tercnini, will differ 
significantly at the "unattached end" from the same sequence as it is found in the native protein where 
the polypeptide backbone is continuous. The problem can be remedied, to a degree, by acetylating the 
a-amino group and then attaching the peptide by way of its carboxy terminus. The coupling 
efficiency to the carrier protein is conveniently measured by using a radioactively labeled peptide, 
prepared either by using a radioactive amino acid for one step of the synthesis or by labeling the 
completed peptide by the iodination of a tyrosine residue. The presence of tyrosine in the peptide also 
allows one to set up a sensitive radioimmune assay, if desirable. Therefore, tyrosine can be 
introduced as a terminal residue if it is not part of the peptide sequence defined by the native 
polypeptide. 

Preferred carriers are proteins, and preferred protein carriers include bovine serum albumin, 

* - *- ■ - - ■ -. » . « . — — ,„ . . . 

linked to KLH through cysteines hy MBS as disclosed by Liu et aL, Biochemistry, Vol. 1 8, pgs. 690- 
697 (1979). The peptides are dissolved in phosphate-buffered saline (pH 7.5), 0.1 M sodium borate 
buffer (pH 9.0) or 1 .0 M sodium acetate buffer (pH 4.0). The pH for the dissolution of the peptide is 
chosen to optimize peptide solubility. The content of free cysteine for soluble peptides is determined 
by Ellman's method, Ellman, Arch. Biochem. Biophys., Vol. 82, pg. 7077 (1959). For each peptide, 4 
mg KLH in 0.25 ml of 10 mM sodium phosphate buffer (pH 7.2) is reacted with 0.7 mg MBS 
(dissolved in dimethyl formamide) and stirred for 30 mm at room temperature. The MBS is added 
dropwise to ensure that the local concentration of formamide is not too high, as KLH is insoluble in 
>30% forrnamide. The reaction product, KLH-MBS, is then passed through Sephadex G-25 
equilibrated with 50 mM sodium phosphate buffer (pH 6.0) to remove free MBS, KLH recovery from 
peak fractions of the column eluate (monitored by OD280) is estimated to be approximately 80%. 
KLH-MBS is then reacted with 5 mg peptide dissolved in 1 ml of the chosen buffer. The pH is 
adjusted to 7-7.5 and the reaction is stirred for 3 hr at room temperature. Coupling efficiency is 
monitored with radioactive peptide by dialysis of a sample of the conjugate against phosphate- 
buffered saline, and may range from 8% to 60%. Once the peptide-carrier conjugate is available, 
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polyclonal or monoclonal antibodies are produced by standard techniques, e.g. as disclosed by 
Campbell, Monoclonal Antibody Technology (Elsevier, New York, 1984); Hurrell, ed Monoclonal 
Hybridoma Antibodies: Techniques and Applications (CRC Press, Boca Raton, FL, 1982); Schreier 
et al. Hybridoma Techniques (Cold Spring Harbor Laboratory, New York, 1980); U.S. Patent 
5 4,562,003; or the like. In particular, U.S. Patent 4,562,003 is incorporated by reference. 

Humanized Antibodies 

Hie anti-CPP antibodies of the invention may further comprise humanized antibodies or 
human antibodies. Hie term "humanized antibody" refers to humanized forms of non-human (e.g., 

10 murine) antibodies that are chimeric antibodies, immunoglobulin chains or fragments thereof (such as 
Fv, Fab, Fab', F(ab'), or other antigen-binding partial sequences of antibodies) which contain some 
portion of the sequence derived from non-human antibody. Humanized antibodies include human 
immunoglobulins in which residues from a complementary determining region (CDR) of the human 
immunoglobulin are replaced by residues from a CDR of a non-human species such as mouse, rat or 

1 5 rabbit having the desired binding specificity, affinity and capacity. In general, the humanized antibody 
will comprise substantially all of at least one, and generally two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or 

humanized antibody optimally also will comprise at least a portion ot an immunoglobulin constant 
20 region (Fc), typically that of a human immunoglobulin (Jones et al., Nature 32 1 :522-525 (1 986) and 
Presta, Curr. Op. Struct. Biol. 2:593-596 (1992)). Methods for humanizing non-human antibodies are 
well known in the art Generally, a humanized antibody has one or more amino acids introduced into 
it from a source which is non-human in order to more closely resemble a human antibody, while still 
retaining the original binding activity of the antibody. Methods for humanization of antibodies are 
25 further detailed in Jones et al., Nature 321:522-525 (1986); Riechmann et al., Nature 332:323-327 
(1988); and Verhoeyen et al., Science 239:1534-1536 (1988). Such "humanized" antibodies are 
chimeric antibodies in that substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from a non-human species. 

3 0 Heteroconjugate Antibodies 

Heteroconjugate antibodies which comprise two covalently joined antibodies, are also within 
the scope of the present invention. Heteroconjugate antibodies may be prepared in vitro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For example, 
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iinmunotoxms may be prepared using a disulfide exchange reaction or by forming a thioether bond. 
Bispecific Antibodies 

Bispecific antibodies have binding specificities for at least two different antigens. Such 
antibodies are monoclonal, and preferably human or humanized. One of the binding specificities of a 
bispecific antibody of $ie present invention is for a CPP, and the other one is preferably for a cell- 
surface protein or receptor or receptor sub unit Methods for making bispecific antibodies are known 
in the art, and in general, the recombinant production of bispecific antibodies is based on the co- 
expression of two immunoglobulin heavy-chain/light-chain pairs in hybridoma cells, where the two 
heavy chains have different specificities, e.g. Milstein and Cuello, Nature 305:537-539 (1983). Given 
that the random assortment of nnmimoglobuiin heavy and light chains results in production of 
potentially ten different antibody molecules by the hybridomas, purification of the correct molecule 
usually requires some sort of affinity purification, e.g. affinity chromatography. 

Uses of CPP antibodies 

CPP antibodies are preferably specific for the CPPs of the invention and, as such, do not bind 
peptides derived from other proteins with high affinity. As used herein, the term "heavy chain 

whose amino acid sequence corresponds to that of a heavy chain of an antibody of the invention, 
starting from the heavy chain's N-terminal amino acid. Likewise, the term "light chain variable 
region" means a polypeptide (1) which is from 95 to 1 1 5 amino acids in length, and (2) whose amino 
acid sequence corresponds to that of a light chain of an antibody of the invention, starting from the 
light chain's N-terminal amino acid. As used herein the term "monoclonal antibody" refers to 
homogeneous populations of immunoglobulins which are capable of specifically binding to CPPs. 

CPP antibodies may be used as functional modulators, preferably as antagonists. Preferably, 
antibody modulators of the invention are derived from monoclonal antibodies specific for CPPs. 
Monoclonal antibodies capable of blocking, or neutralizing, CPPs are selected by their ability to 
inhibit a CPP biological activity. 

The use of antibody fragments is also well known, e.g. Pab fragments: Tijssen, Practice and 
Theory of Enzyme Immunoassays (Elsevier, Amsterdam, 1985); and Fv fragments: Hochman et al. 
Biochemistry, VoL 12, pgs. 1 130-1 135 (1973), Sharon et al., Biochemistry, Vol. 15, pgs. 1591-1594 
(1976) and Ehriich et al., U.S. Patent 4,355,023; and antibody half molecules: Audftore- Hargreaves, 
US. Patent 4,470,925. 
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Preferably, monoclonal antibodies, Fv fragments, Fab fragments, or other binding 
compositions derived from monoclonal antibodies of the invention have a high affinity to CPPs. The 
affinity of monoclonal antibodies and related molecules to CPPs may be measured by conventional 
techniques including plasmon resonance, ELISA, or equilibrium dialysis. Affinity measurement by 
plasmon resonance techniques may be carried out, for example, using a BIAcore 2000 instrument 
(Biacore AB, Uppsala, Sweden) in accordance with the manufacturer's recommended protocol. 
Preferably, affinity is measured by ELISA, as described in ILS. patent 6,235,883, for example. 
Preferably, the dissociation constant between CPPs and monoclonal antibodies of the invention is less 
than 10" 5 molar. More preferably, such dissociation constant is less than 10" 8 molar; still more 
preferably, such dissociation constant is less than 10"* molar; and most preferably, such dissociation 
constant is in the range of 1 0" 9 to 10~ n molar. 

In addition, the antibodies of the present invention are useful for detecting CPPs. Such 
detection methods are advantageously applied to diagnosis of cardiovascular disorders, in particular, 
coronary artery disease. The antibodies of the invention may be used in most assays involving 
antigen-antibody reactions. The assays may be homogeneous or heterogeneous. In a homogeneous 
assay approach, the sample can be a biological sample or fluid such as serum, urine, whole blood, 
lymphatic fluid, plasma, saliva, cells, tissue, and material secreted by cells or tissues cultured in vitro. 

usually involves the specific antibody, a labeled analyte, and the sample suspected of containing toe 
antigen. The signal arising from the label is modified, directly or indirectly, upon the binding of the 
antibodjy to the labeled analyte. Both immunological reaction and detection of the extent thereof axe 
carried out in a homogeneous solution. Immunochemical labels which may be employed include free 
radicals, fluorescent dyes, enzymes, bacteriophages, coenzymes, and so forth. 

hi a heterogeneous assay approach, the reagents are usually the sample, the specific antibody, 
and means for producing a detectable signal. The specimen is generally placed on a support, such as a 
plate or a slide, and contacted with the antibody in a liquid phase. The support is then separated from 
the liquid phase and either the support phase or the liquid phase is examined for a detectable signal 
employing means for producing such signal or signal producing system. The signal is related to the 
presence of the antigen in the sample. Means for producing a detectable signal includes the use of 
radioactive labels, fluorescent compounds, enzymes, and so forth. Exemplary heterogeneous 
immunoassays are the radioimmunoassay, immunofluorescence methods, enzyme-linked 
immunoassays, and the like. 

For a more detailed discussion of the above immunoassay techniques, see "Enzyme- 
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Immunoassay," by Edward T. Maggio, CRC Press, Inc., Boca Raton, Fla., 1980, See also, for 
example, U.S. Pat Nos. 3,690,834; 3,791,932; 3,817,837; 3,850,578; 3,853,987; 3,867,517; 
3,901,654; 3,935,074; 3,984,533; 3,966,345; and 4,098,876, which listing is not intended to be 
exhaustive. Methods for conjugating labels to antibodies and antibody fragments are well known in 
the art Such methods may be found in U.S. Pat Nos. 4,220,450; 4,235,869; 3,935,974; and 
3,966,345. Another example of a technique in which the antibodies of the invention may be employed 
is immunoperoxidase labeling. (Sternberger, Irmnunocytochemistry (1979) pp. 104-169). 
Alternatively, the antibodies may be bound to a radioactive material or to a drug to form a 
radiopharmaceutical or pharmaceutical, respectively. {Carrasquillo, et at, Cancer Treatment Reports 
(1984)68:317-328). 

One embodiment of an assay employing an antibody of fee present invention involves the use 
of a surface to which the monoclonal antibody of the invention is attached. The underlying structure 
of the surface may take different forms, have different compositions and may be a mixture of 
compositions or laminates or combinations thereof. The surface may assume a variety of shapes and 
forms and may have varied dimensions, depending on the manner of use and measurement Illustrative 
surfaces may be pads, beads, discs, or strips which may be flat, concave or convex. Thickness is not 
critical, generally being from about 0.1 to 2 mm thick and of any convenient diameter or other 



cuvette and will typically be porous and polyfunctional or capable of being poiyfunctionahzed so as 
20 to permit covalent binding of an antibody and permit bonding of other compounds winch form a part 
of a means for producing a detectable signal. A wide variety of organic and inorganic polymers, bom 
natural and synthetic, and combinations thereof, may be employed as the material for the solid 
surface. Illustrative polymers include polyethylene, polypropylene, poly(4-methyIbutene), 
polystyrene, polymethracrylate, poly(ethylene terephthalate), rayon, nylon, poly( vinyl butyrate), 
25 silicones, polyformaldehyde, cellulose, cellulose acetate, nitrocellulose, and latex. Other surfaces 

include paper, glasses, ceramics, metals, metaloids, semiconductor materials, cements, silicates or the 
like. Also included are substrates that form gels, gelatins, lipopolysaccharides, silicates, agarose and 
polyacrylamides or polymers which form several aqueous phases such as dextrans, polyalkylene 
glycols (alkylene of 2 to 3 carbon atoms) or surfactants such as phospholipids. The binding of the 
30 antibody to the surface may be accomplished by well known techniques, commonly available in the 
literature (see, for example, "Immobilized Enzymes," Ichiro Chibata, Press, New York (1978) and 
Cuatrecasas, J. Bio. Chem., 245: 3059 (1970)). In carrying out the assay in accordance with this 
aspect of the invention, the sample is mixed with aqueous medium and the medium is contacted with 
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the surface having an antibody bound thereto. Labels may be included in the aqueous medium, either 
concurrently or added subsequently so as to provide a detectable signal associated with the surface. 
The means for producing the detectable signal can involve the incorporation of a labeled analyte or it 
may involve the use of a second monoclonal antibody having a label conjugated thereto. Separation 
and washing steps will be carried out as needed. The signal detected is related to the presence of CPP 
in the sample. It is within the scope of the present invention to include a calibration on the same 
support A particular embodiment of an assay in accordance with the present invention, by way of 
illustration and not limitation, involves die use of a support such as a slide or a well of a petri dish. 
The technique involves fixing the sample to be analyzed on the support with an appropriate fixing 
material and incubating the sample on the slide with a monoclonal antibody. After washing with an 
appropriate buffer such as, for example, phosphate buffered saline, the support is contacted with a 
labeled specific binding partner for the antibody. After incubation as desired, the slide is washed a 
second time with an aqueous buffer and the determination is made of the binding of the labeled 
monoclonal antibody to the antigen. If the label is fluorescent, the slide may be covered with a 
fluorescent antibody mounting fluid on a cover slip and then examined with a fluorescent microscope 
to determine the extent of binding. On the other hand, the label can be an enzyme conjugated to the 
monoclonal antibody and the extent of binding can be determined by examining the slide for the 



particular example or an assay utilizing the present antibodies is a double determinant kllsa assay. 
A support such as, e.g., a glass or vinyl plate, is coated with an antibody specific for CPP by 
conventional techniques. The support is contacted with the sample suspected of containing CPP, 
usually in aqueous medium. After an incubation period from 30 seconds to 12 hours, the support is 
separated from the medium, washed to remove unbound CPP with, for example, water or an aqueous 
buffered medium, and contacted with an antibody specific for CPP, again usually in aqueous medium. 
The antibody is labeled with an enzyme directly or indirectly such as, e.g., horseradish peroxidase or 
alkaline phosphatase. After incubation, the support is separated from the medium, and washed as 
above. The enzyme activity of the support or the aqueous medium is determined. This enzyme activity 
is related to the amount of CPP in the sample. 

The invention also includes kits, e.g., diagnostic assay kits, for carrying out the methods 
disclosed above. In one embodiment, the kit comprises in packaged combination (a) a monoclonal 
antibody more specifically defined above and (b) a conjugate of a specific binding partner for the 
above monoclonal antibody and a label capable of producing a detectable signal. The reagents may 
also include ancillary agents such as buffering agents and protein stabilizing agents, e.g., 
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polysaccharides and the like. The kit may further include, where necessary, other members of the 
signal producing system of which system the label is a member, agents for reducing background 
interference in a test, control reagents, apparatus for conducting a test, and the like. In another 
embodiment, the diagnostic kit comprises a conjugate of monoclonal antibody of the invention and a 
label capable of producing a detectable signal. Ancillary agents as mentioned above may also be 
present. 

Further, an anti-CPP antibody (e.g., monoclonal antibody) can be used to isolate CPPs by 
standard techniques, such as affinity chromatography or immunoprecipitation. For example, an anti- 
CPP antibody can facilitate the purification of natural CPPs from cells and of recombinantly produced 
CPP expressed in host cells. Moreover, an anti-CPP antibody can be used to isolate CPP to aid in 
detection of low concentrations of CPP (e.g., in plasma, cellular lysate or cell supernatant) or in order 
to evaluate the abundance and pattern of expression of the CPP. Anti-CPP antibodies can be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to 
determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling (i.e., 
physically linking) the antibody to a label group. 

Protein Arrays 

accomplished using retentate chromatography (preferably, protein arrays or chips), as described by 
U.S. Patent 6225027 and U.S. Patent Application 20010014461, disclosures of which are herein 
incorporated by reference in their entireties. Briefly, retentate chromatography describes methods in 
which polypeptides (and/ or other sample components) are retained on an adsorbent (e.g., array or 
chip) and subsequently detected. Such methods involve (l) selectively adsorbing polypeptides from a 
sample to a substrate under a plurality of different adsorbent/ eluant combinations ("selectivity 
conditions") and (2) detecting the retention of adsorbed polypeptides by desorption spectrom e try (e.g., 
by mass spectrometry). In conventional chromatographic methods, polypeptides are eluted off of the 
adsorbent prior to detection. Hie coupling of adsorption chromatography with detection by 
desorption spectrometry provides extraordinary sensitivity, the ability to rapidly analyze retained 
components with a variety of different selectivity conditions, and parallel processing of components 
adsorbed to different sites (i.e., "affinity sites 0 or "spots") on the array under different elution 
conditions. 

These methods are useful for: combinatorial, biochemical separation and purification of the 
CPPs; study of differential gene expression; detection of differences in protein levels (e.g., for 
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diagnosis); and detection of molecular recognition events (e.g., for screening and drug discovery). 
Thus, this invention provides a molecular discovery and diagnostic device that is characterized by the 
inclusion of both parallel and multiplex polypeptide processing capabilities. Polypeptides of the 
invention and CPP-binding substances are preferably attached to a label group, and thus directly 
detected, enabling simultaneous transmission of two or more signals from the same "circuit 11 (i.e., 
addressable "chip" location) during a single unit operation. 

Detection of CPPs by mass spectrometry 

In accordance with the present invention, any instrument, method, process, etc. can be utilized 
to determine the identity and abundance of proteins in a sample. A preferred method of obtaining 
identity is by mass spectrometry, where protein molecules in a sample are ionized and then the 
resultant mass and charge of the protein ions are detected and determined. 

To use mass spectrometry to analyze proteins, it is preferred that the protein be converted to a 
gas-ion phase. Various methods of protein ionization are useful, including, e.g., fast ion bombardment 
(FAB), plasma desorption, laser desorption, thermal desorption, preferably, electrospray ionization 
(ESI) and matrix-assisted laser desorption/ionization (MALDI). Many different mass analyzers are 
available for peptide and protein analysis, including, but not limited to, Tune-of-Flight (TOF), ion 

i~^/TT\ TC\ T7^,,^_ ,_r* : — N /» - < , rr^m*, m\ j „t_ ; „ t _ f , _ fc . _ , . 

magnetic; speedometers, oee, e.g., t>.o. rat. ivo. Dp /z,023 tor an ion trap Ms. Mass analyzers can be 
used alone, or in combination with other mass analyzers in tandem mass spectrometers, hi the latter 
case, a first mass analyzer can be use to separate the protein ions (precursor ion) from each other and 
determine the molecular weights of die various protein constituents in the sample. A second mass 
analyzer can be used to analyze each separated constituents, e.g., by fragmenting the precursor ions 
into product ions by using, e.g. an inert gas. Any desired combination of mass analyzers can be used, 
including, e.g., triple quadrupoles, tandem time-of-flights, ion traps, and/or combinations thereof. 

Different kinds of detectors can be used to detect the protein ions. For example, destructive 
detectors can be utilized, such as ion electron multipliers or cryogenic detectors (e.g., U.S. Pat. No. 
5,640,010). Additionally, non-destructive detectors can be used, such as ion traps which are used as 
ion current pick-up devices in quadrupole ion trap mass analyzers or FTMS. 

For MALDI-TOF, a number of sample preparation methods can be utilized including, dried 
droplet (Karasand Hillenkamp, Anal. Chem., 60:2299-2301, 1988), vacuum-drying (Winberger et al., 
In Proceedings of the 41st ASMS Conference on Mass Spectrometry and Allied Topics, San 
Francisco, May 31-June 4, 1993, pp. 775a-b), crush crystals (Xiang et aL, Rapid Comm. Mass 
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Spectrom., 8:199-204,1994), slow crystal growing (Xiang et al., Org. Mass Spectrom, 28:1424-1429, 
1993); active film (Mock et al., Rapid Comm. Mass SpectronL,6:233-238, 1992; Bai et al., Anal. 
Chem, 66:3423-3430, 1994), pneumatic spray (Kochling et al., Proceedings of the 43rd ASMS 
Conference on Mass Spectrometry and Allied Topics; Atlanta, GA, May 21-26, 1995, pl225); 
5 electrospray (Hensel et al., Proceedings of the 43rd ASMS Conference on Mass Spectrometry and 
Allied Topics; Atlanta, GA, May 21 -26, 1995, p947); fast solvent evaporation (Vorm et al., 
Anal. Chem., 66:3281-3287, 1994); sandwich (Li et aL, J. Am. Chem. Soc, 1 1 8:1 1662-1 1663,1996); 
and two-layer methods (Dal et al., Anal. Chem., 71:1087-1091, 1999). See also, e.g., Liang et al., 
Rapid Commim. Mass Spectrom., 10: 1219-1226, 1996; van Adrichemet al., Anal. Chem., 70:923- 
10 930, 1998. 

For MALDI analysis, samples are prepared as solid-state co-crystals or thin films by mixing 
them with an energy absorbing compound or colloid (the matrix) in the liquid phase, and ultimately 
drying the solution to the solid state upon the surface of an inert probe. Jh some cases an energy 
absorbing molecule (EAM) is an integral component of the sample presenting surface. Regardless of 
15 EAM application strategy, the probe contents are allowed to dry to the solid state prior to introduction 
into the laser desorption/ionization time-of-flight mass spectrometer (LDIMS). 

Ion detection in TOF mass spectrometry is typically achieved with the use of electro-emissive 

function by converting primary incident charged particles into a cascade ot secondary, tertiary, 
20 quaternary, etc. electrons. The probability of secondary electrons being generated by the impact of a 
single incident charged particle can be taken to be the ion-to-electron conversion efficiency of this 
charged particle (or more simply, the conversion efficiency). The total electron yield for cascading 
events when compared to the total number of incident charged particles is typically described as the 
detector gain. Because generally the overall response time of MCPs is for superior to that of EMPs, 
25 MCPs are the preferred electro-emissive detector for enhancing mass/charge resolving power. 

However, EMPs function well for detecting ion populations of disbursed kinetic energies, where rapid 
response time and broad frequency bandwidth are not necessary. 

In a preferred aspect, for the analysis of digested proteins, a liquid-chromatography tandem 
mass spectrometer (LC-TMS) is used. This system provides an additional stage of sample separation 
30 via use of a liquid chromatograph followed by tandem mass spectrometry. 

In preferred aspects, a protein eluted from a column according to the system described in 
Example 1 is analyzed using both MS and MS-MS analysis. For example, a small portion of intact 
proteins eluting from RP2 may be diverted to online detection using LC-ESI MS. The proteins are 
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ali quoted on a number of plates allowing digestion or not with trypsin, preparation for MALDI-MS as 
well as for ESI-MS, as well as preparation of the MALDI plates with different matrices. The methods 
thus allow, in addition to information on intact mass, to conduct an analysis by both peptide mass 
fingerprinting and MS-MS techniques. 

The methods described herein of separating and fractionating proteins provide individual ~ 
proteins or fractions containing small numbers of distinct proteins. These proteins can be identified 
by mass spectral determination of the molecular masses of the protein and peptides resulting from the 
fragmentation thereof. Making use of available information in protein sequence databases, a 
comparison can be made between proteolytic peptide mass patterns generated in silico, and 
experimentally observed peptide masses. A "hit«list n can be compiled, ranking candidate proteins in 
the database, based on (among other criteria) the number of matches between the theoretical and 
experimental proteolytic fragments. Several Web sites are accessible that provide software for protein 
identification on-line, based on peptide mapping and sequence database search strategies (e.g., 
http://www.expasy.ch). Methods of peptide mapping and sequencing using MS are described in WO 
95/252819, U.S. Pat. No. 5,538,897, U.S. Pat. No. 5,869,240, U.S. Pat No. 5,572,259, and U.S. Pat 
No. 5,696,376. See, also, Yates, J. Mass Spec., 33:1 (1998). 

Data collected from a mass spectrometer typically comprises the intensity and mass to charge 

^p-aphical, numerical, or electronic formats, either in digital or analog form. Spectra are preferably 
recorded in a storage medium, including, e.g., magnetic, such as floppy disk, tape, or hard disk; 
optical, such as CD-ROM or laser-disc; or, ROM-CHIPS. 

The mass spectrum of a given sample typically provides information on protein intensity, 
mass to charge ratio, and molecular weight In preferred embodiments of the invention, the molecular 
weights of proteins in the sample are used as a matching criterion to query a database. The molecular 
weights are calculated conventionally, e.g., by subtracting the mass of the ionizing proton for singly- 
charged protonated molecular ions, by multiplying the measured mass/charge ratio by the number of 
charges for multiply-charged ions and subtracting the number of ionizing protons. 

Various databases are useful in accordance with the present invention. Useful databases 
include, databases containing genomic sequences, expressed gene sequences, and/or expressed protein 
sequences. Preferred databases contain nucleotide sequence-derived molecular masses of proteins 
present in a known organism, organ, tissue, or cell-type. There are a number of algorithms to identify 
open reading frames (ORF) and convert nucleotide sequences into protein sequence and molecular 
weight information. Several publicly accessible databases are available, including, the 
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SwissPROT/TrEMBL database (http://www.expasy.ch). 

Typically, a mass spectrometer is equipped with commercial software that identifies peaks 
above a certain threshold level, calculates mass, charge, and intensity of detected ions. Correlating 
molecular weight with a given output peak can be accomplished directly from the spectral data, i.e., 
where the charge on an ion is one and the molecular weight is therefore equal to the numerator value 
minus the mass of the ionizing proton. However, protein ions can be complexed with various counter- 
ions and adducts, such as N, C, and K\ hi such a case, it would be expected that a given protein ion 
would exhibit multiple peaks, such as a triplet, representing different ionic states (or species) of the 
same protein. Thus, it may be necessary to analyze and process spectral data to determine families of 
peaks arising from the same protein. This analysis can be carried out conventionally, e.g., as described 
by Mann et al., anal. Chem., 61:1702-1708 (1989). 

In matching a molecular mass calculated from a mass spectrometer to a molecular mass 
predicted from a database, such as a genomic or expressed gene database, post-translation processing 
may have to be considered. There are various processing events which modify protein structure, 
including, proteolytic processing, removal of N-tenninai methionine, acetylation, methyl ation, 
glycosylation, phosphorylation, etc. 

A database can be queried for a range of proteins matching the molecular mass of the 

which the sample was prepared, «tc. tsasea on tne namoer ox nits (^watrc a nit is matcnj m uie 
spectrum, the unknown protein or peptide is identified or classified- 
Methods of identifying one or more CPP by mass spectrometry are useful for diagnosis and 
prognosis of cardiovascular disorders. Preferably, such methods are used to detect one or more CPP 
present in human plasma. Exemplary techniques are described in U.S. Patent Applications 
02/0060290, 02/0137106, 02/0138208, 02/0142343, 02/0155509, disclosures of which are 
incorporated by reference in their entireties. 

Diagnostic and Prognostic Uses 

The nucleic acid molecules, proteins, protein homologues, and antibodies described herein 
can be used in one or more of die following methods: diagnostic assays, prognostic assays, monitoring 
clinical trials, drug screening, and pharmacogenetics as further described herein. 

The invention provides diagnostic and prognostic assays for detecting CPP nucleic acids and 
proteins, as further described. Also provided are diagnostic and prognostic assays for detecting 
interactions between CPPs and CPP target molecules, particularly natural agonists and antagonists. 
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The present invention provides methods for identifying polypeptides that are differentially 
expressed between two or more samples. "Differential expression" refers to differences in the quantity 
or quality of a polypeptide between samples. Such differences could result at any stage of protein 
expression from transcription through post-translational modification. For example, using protein 
array methods, two samples are bound to affinity spots on different sets of adsorbents (e.g., chips) and 
recognition maps are compared to identify polypeptides that are differentially retained by the two sets 
of adsorbents. Differential retention includes quantitative retention as well as qualitative differences 
in the polypeptide. For example, differences in post-translational modification of a protein can result 
in differences in recognition maps detectable as differences in binding characteristics (e.g., 
glycosylated proteins bind differently to lectin adsorbents) or differences in mass (e.g., post- 
translational cleavage products). In certain embodiments, an adsorbent can have an array of affinity 
spots selected for a combination of markers diagnostic for a disease or syndrome. 

Differences in polypeptide levels between samples (e.g., differentially expressed CPPs in 
plasma samples) can be identified by exposing die samples to a variety of conditions for analysis by 
desorption spectrometry (e.g., mass spectrometry). Unknown proteins can be identified by detecting 
physicocherxrical characteristics (e.g., molecular mass), and this information can be used to search 
databases for proteins having similar profiles. 

provide information about tee size and cuaracter or me particular t isoionn mat is present in a 
sample, e.g., a biological sample submitted for diagnosis or prognosis. Mass spectrometry techniques 
are detailed in the section titled 'Detection of CPPs by mass spectrometry". Example 1 outlines a 
preferred detection scheme, wherein a biological sample is separated by chromatography before 
characterization by mass spectrometry. The invention provides a method of detecting a CPP in a 
biological sample comprising the steps of: fractionating a biological sample {e.g., plasma, serum, 
lymph, cerebrospinal fluid, cell lysate of a particular tissue) by at least one chromatographic step; 
subjecting a fraction to mass spectrometry; and comparing the characteristics of polypeptide species 
observed in mass spectrometry with known characteristics of CPP polypeptides (e.g., CPP 12, CPP 
15, CPP 16, and CPP 18, as disclosed in Table 1). 

The isolated nucleic acid molecules of the invention can be used, for example, to detect CPP 
mRNA (e.g., in a biological sample) or a genetic alteration in a CPP-encoding gene, and to modulate a 
CPP activity, as described further below. In addition, the CPPs can be used to screen for naturally 
occurring CPP target molecules, and to screen for drugs or compounds which modulate CPP activity. 
Moreover, the anti- CPP antibodies of the invention can be used to detect and isolate CPPs, regulate 
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the bioavailability of CPPs, and modulate CPP activity. 

Accordingly one embodiment of fee present invention involves a method of use wherein a 
molecule of the present invention (e.g., a CPP, CPP nucleic acid, CPP modulator, or antibody) is used, 
for example, to diagnose and/or prognose a disorder in which any of the aforementioned CPP 
activities is indicated. In another embodiment, the present invention involves a method of use wherein 
a molecule of the present invention is used, for example, for the diagnosis and/or prognosis of 
subjects, preferably a human subject, in which any of the aforementioned activities is pathologically 
perturbed. 

For example, the invention encompasses a method of determining whether a CPP is expressed 
within a biological sample comprising: a) contacting said biological sample with: i) a polynucleotide 
that hybridizes under stringent conditions to a CPP nucleic acid; or ii) a detectable polypeptide (e.g. 
antibody) that selectively binds to a CPP; and b) detecting the presence or absence of hybridization 
between said polynucleotide and an RNA species within said sample, or the presence or absence of 
binding of said detectable polypeptide to a polypeptide within said sample. Detection of said 
hybridization or of said binding indicates that said CPP is expressed within said sample. Preferably, 
the polynucleotide is a primer, wherein said hybridization is detected by detecting the presence of an 
amplification product comprising said primer sequence, or the detectable polypeptide is an antibody. 

iwauuwii irwy (c^v, v.fr, ^j.^.j.m. twjfc, -,,<jaj,i73 ana h.oo j^uz, tne disclosures oi wiucii are 
incorporated herein by reference in their entireties), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegren et al. (1988) Science 241:1077- 
1080; and Nakazawa et al. (1994) PNAS 91:360-364, the disclosures of which are incorporated herein 
by reference in their entireties), the latter of which can be particularly useful for detecting point 
mutations in the CPP-encoding-gene (see Abravaya et al. (1995) Nucleic Acids Res. 23:675-682, the 
disclosure of which is incorporated herein by reference in its entirety). 

Also envisioned is a method of determining whether a mammal, preferably human, has an 
elevated or reduced level of expression of a CPP, comprising: a) providing a biological sample from 
said mammal; and b) comparing the amount of a CPP or of a CPP RNA species encoding a CPP 
within said biological sample with a level detected in or expected from a control sample. An increased 
amount of said CPP or said CPP RNA species within said biological sample compared to said level 
detected in or expected from said control sample indicates that said mammal has an elevated level of 
CPP expression, and a decreased amount of said CPP or said CPP RNA species within said biological 
sample compared to said level detected in or expected from said control sample indicates that said 
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1 has a reduced level of expression of a CPP. 
The present invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, and monitoring clinical trials are used for prognostic purposes. 
Accordingly, one aspect of the present invention relates to diagnostic assays for determining CPP 
and/or nucleic acid expression as well as CPP activity, in the context of a biological sample (e.g., 
blood, plasma, cells, tissue) to thereby determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant CPP expression or activity. 
The invention also provides for prognostic (or predictive) assays for determining whether an 
individual is at risk of developing a disorder associated with a CPP, nucleic acid expression or 
activity. For example, mutations in a CPP-encoding gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an individual 
prior- to the onset of a disorder characterized by or associated with CPP expression or activity. 

The term "biological sample" is intended to include tissues, cells and biological fluids isolated 
from an individual, as well as tissues, cells and fluids present within an individual. That is, the 
detection methods of the invention can be used to detect a CPP mRNA, protein, or genomic DNA in a 
biological sample m vitro as well as in vivo. Preferred biological samples are biological fluids such as 
lymph, cerebrospinal fluid, blood, and especially blood plasma. For example, in vitro techniques for 



techniques lor <aetecuonoi &<J±*t mcmae mass spectrometry, tunzyme j_inicea immuno cs>oroem 
20 Assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence. In vitro techniques 



for detection of a CPP-enc 
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ig genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a CPP include introducing into an individual a labeled anti- CPP antibody. 

In preferred embodiments, the subject methods can be characterized by generally comprising 
detecting, in a tissue sample of the individual (e.g. a human patient), the presence or absence of a 
genetic lesion characterized by at least one of (i) a mutation of a gene encoding one of the subject 
CPP or (ii) the mis-expression of a CPP-encoding gene. To illustrate, such genetic lesions can be 
detected by ascertaining the existence of at least one of (i) a deletion of one or more nucleotides from 
the CPP-encoding gene, (ii) an addition of one or more nucleotides to the gene, (iii) a substitution of 
one or more nucleotides of the gene, (iv) a gross chromosomal rearrangement or anxplification of the 
gene, (v) a gross alteration in the level of a messenger RNA transcript of the gene, (vi) aberrant 
modification of the gene, such as of the methylation pattern of the genomic DNA, (vii) the presence of 
a non-wild type splicing pattern of a messenger RNA transcript of the gene, and (viii) reduced level of 
expression, indicating lesion in regulatory element or reduced stability of a CPP-encoding transcript 
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In yet another exemplary embodiment, aberrant methyiation patterns of a CPP nucleic acid 
can be detected by digesting genomic DNA from a patient sample with one or more restriction 
endomicleases that are sensitive to methyiation and for which recognition sites exist in the CPP- 
encoding gene (including in the flanking and intronic sequences). See, for example, Buiting et al. 
(1994) Human Mol Genet 3:893-895. Digested DNA is separated by gel electrophoresis, and 
hybridized with probes derived from, for example, genomic or cDNA sequences. The methyiation 
status of the CPP-encoding gene can be determined by comparison of the restriction pattern generated 
from the sample DNA with that for a standard of known methyiation. 



In yet another eml 



•'•mid 



tent, a diagnostic assay is provided which detects the ability of a CPP 
to bind to a cell surface or extracellular protein. For instance, it will be desirable to detect CPP 
mutants which, while expressed at appreciable levels in the cell, are defective at binding a CPP target 
protein (having either diminished or enhanced binding affinity for the target). Such mutants may arise, 
for example, from mutations, e.g., point mutants, which may be impractical to detect by the diagnostic 
DNA sequencing techniques or by the immunoassays described above. The present invention 
accordingly further contemplates diagnostic screening assays which generally con ipnse cloning one 
or more CPP-encoding gene from the sample tissue, and expressing the cloned genes under conditions 
which permit detection of an interaction between that recombinant gene product and a target protein. 



variety of techniques can be used to determine the ability ot a Ctt to bind to other components. 
These techniques can be used to detect mutations in a CPP-encoding gene which give rise to mutant 
proteins with a higher or lower binding affinity for a CPP target protein relative to the wild-type CPP. 
Conversely, by switching which of the CPP target protein and CPP is die "bait" and which is derived 
from die patient sample, the subject assay can also be used to detect CPP target protein mutants which 
have a higher or lower binding affinity for a CPP relative to a wild type form of that CPP target 
protein. 

In an exemplary embodiment, a target protein can be provided as an immobilized protein (a 
"target"), such as by use of GST fusion proteins and glutathione treated microliter plates as described 
herein. 

In another embodiment, the methods further involve obtaining a control biological sample 
from a control subject, contacting the control sample with a compound or agent capable of detecting a 
CPP, mRNA, or genomic DNA, such that the level of a CPP, rriRNA or genomic DNA is measured in 
the biological sample, and comparing the level of the CPP, mRNA or genomic DNA in die control 
sample to the level in the test sample. The invention also encompasses kits for detecting the presence 
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of a CPP, mRNA or genomic DNA in a biological sample. For example, the kit can comprise: a 
labeled compound or agent capable of detecting a CPP, mRNA or genomic DNA in a biological 
sample; means for dete rm i nin g the amount of a CPP in the sample; and means for comparing the 
amount of CPP in the sample with a standard. The compound or agent can be packaged in a suitable 
container. The kit can further comprise instructions for using the kit to detect CPP or nucleic acid. 

CPPs clusters 

hi one aspect of the invention, methods for the diagnosis of cardiovascular disorders comprise 
detecting in a test biological sample the presence or level of one or more CPP of the invention in 
combination with the detection of other Cardiovascular disorder Plasma Polypeptides (CPPs). 
Particularly preferred other CPPs for use in the diagnosis of cardiovascular disorders in combination 
with CPPs of the invention are listed in Table 2. 
Table 2 



Table 2 


CPP# 


CfcX 


RM 


Tryptfc Sequences (RP2) 


CPP 2 


9 


9 


TIVGSITNTNFGICHDAGR (10) 


CPP 2 


10 


9 


CTSMASENSECSVK (13-14). SNCCQHSSALGLAR (13-14) 


CPP 8 


18 


6 


CLDPVDTPKPTR (7-8). YKKPECQSDWQCPGK (8) 










CPP 8 


18 


13 


CLDPVDTPNPTR (3) 


CPP 9 


2 


8 


DPPQYPWPVHLDR (22), RDPPQYPWPVHLDR (22-23). YAQTPANMFYJVACDNR (19- 
24) 


CPP 9 


3 


10 


YAQTPANMFYTVACDNR (13) 


CPP 9 


3 


11 


YAQTPANMFYIVACDNR (14-15) . 


CPP 9 


4 


10 


RDPPQYPWPVHLDR (12), YAQTPANMFYIVACDNR (12-13) 


CPP 9 


5 


11 


YAQTPANMFYTVACDNR ( 13 ) 


CPP 9 


8 


12 


YAQTPANMFYIVACDNR (7) 


CPP 9 


9 


11 


YAQTPANMFYtVACONR (10, 13) 


CPP 9 


9 


12 


j RDPPQYPWPVHLDR (8), YAQTPANMFYIVACDNR (8-9) 


CPP 13 


13 


14 


VSAQQVQGVHAR (9, 12) 


CPP 13 


13 


18 


FPVYDYDPSSLR (5). VNSQSLSPYLFR (5-6) 


CPP 13 


13 


19 


DYWSTAVCR (5-6), FPVYDYDPSSLR (6), VSAQQVQGVHAR (6) 


CPP 13 


13 


20 


DYYVSTAVCR (4-5), FPVYDYDPSSLR (5), VNSQSLSPYLFR (4-5), VSAQQVQGVHAR (5) 


CPP 13 


13 


21 


DYYVSTAVCR (5), VNSQSLSPYLFR (5) 


CPP 13 


13 


22 


DYYVSTAVCR (10), FPVYDYDPSSLR (3), VNSQSLSPYLFR (3-4) 


CPP 13 


13 


23 


VNSQSLSPYLFR (3) 


CPP 13 


13 


25 


DYYVSTAVCR (1), VNSQSLSPYLFR (1) 


CPP 13 


14 




DALSASWK (15), DSGEDPATCAFQR (15), FPVYDYDPSSLR (15), VNSQSLSPYLFR 
(14X VSAQQVQGVHAR (15) 
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Table 2 


rpp g 


f»f7Y 


rvr 1 


Tiyptic Sequences (RP2) 


vrr !*"> 


♦4 




UoiafcUHAIUAMjiK (ID), VN5Q5LSPYLFR (10) 


pop <i 






V OnUUV UO V nAK {/) 


CPP 13 


14 


£. 1 


VOMVJUVUu VnAK {D, / , OJ 


CPP 13 


14 




" VWL<ir»55l QDVf PO "" " — — — 1 
viwuoLoriLrn 


CPP 13 


14 


25 


VOnyuvUuVnnn {^J 


CPP 13 


15 






CPP 13 


15 


15 


vtNOUdLOrTLrn l« «/ 


CPP 1*? 


1ft 




vaHuuvui)vri>u*\ ^jj 


CPP 14 


15 


A 
*r 


Vj7 voLJvr ioAoLrK ( TO) 


^PP 14 


1 o 






pop 55 


1/ 


9 


ovbLRPlGASCRDDStCI IVc (9-1 U) 


CPP 14 


18 


7 


GVSLRPIGASCR (9) 


CPP 17 


A 

~ 


8 




CPP 17 




9 


IPACIAGER (11, 16), YGTCIYQGR (12) 


' pop 47 


5 


13 


YGTCIYQGR (7) 


CPP 17 


6 


8 


IPACIAGER (9), YGTCIYQGR (9-10) " " 




6 


9 


fPACIAGER (9), YGTCIYQGR (10-11) 


CPP 17 


7 


7 


IPACIAGER (13-15), YGTCfYGGR (12-15) 




7 


8 


IPAC1AGER (8-12), RYGTCJYQGR (9), YGTCIYQGR (8-9, 11,12) 


CPP 17 


7 


9 


IPAC1AGER (8-1 1 ). YGTCIYQGR (8-1 1) 


Urr 1 ( 


7 


10 


IPACIAGER (7), YGTCIYQGR (7-8) 


i j 








8 


8 


IPACIAGER (8, 11, 12, 16). IPACIAGERR (8), LWAFCC (8-9, 11, 12), RYGTCIYQGR (8-9), 
YGTCIYQGR (7-13), YGTC3YQGRLWAFCC (8) 




8 


9 


IPACIAGER (7-14, 16. 17), IPACIAGERR (11), LWAFCC (9-10. 12), RYGTCIYQGR (8, 10, 
11, 12, 14X YGTCIYQGR (7-14, 17. 18), YGTCIYQGRLWAFCC (8. 12) 


ODD 47 


8 


10 


IPACIAGER (6, 8, 9 t 10, 11. 13, 14), LWAFCC (6-9). RYGTCIYQGR (6-7), YGTCIYQGR (6- 
9, 11, 13. 14. 15. 16) 


CPP 17 


8 


11 


IPACIAGER (5-9, 11, 12), LWAFCC (6-8). RYGTCIYQGR (6). YGTCIYQGR (5-9, 14) 


CPP 17 


8 


12 


IPACIAGER (4-9. 12), YGTCIYQGR (4-9) 


CPP 17 


8 


13 


IPACIAGER (4-7). YGTCIYQGR (4-6) 


CPP 17 


8 


14 


ADEVAAAPEQ? AAD! P EVWSLAWDESLAPK (6), IPACIAGER (2-4). LWAFCC (3), 
YGTCIYQGR (2, 4) 


CPP 17 


8 


15 


IPACIAGER (3. 8). YGTCIYQGR {3) 


CPP 17 


9 


7 


IPACIAGER (11-13). LWAFCC (12). RYGTCIYQGR (13). YGTCIYQGR (10-14) 


CPP 17 


9 


8 


IPACIAGER (7-12), RYGTCIYQGR (8-9, 11). YGTCIYQGR (8) 


CPP 17 


9 


9 


IPACIAGER (8-11, 13, 14, 15, 17, 18, 19, 20), IPACIAGERR (8). LWAFCC (8-9, 11), 
RYGTCIYQGR (8-9). YGTCIYQGR (11, 13, 14) 


CPP 17 


9 


10 


IPACIAGER (8). LWAFCC (7), YGTCIYQGR (7-8) 


CPP 17 


9 


11 


IPACIAGER (8X LWAFCC (8), YGTCIYQGR (7-8) 


CPP 17 


9 


12 


ADEVAAAPEQiAADIPEVWSLAWDESLAPK (8). IPACIAGER (7, 10, 11), YGTCIYQGR (6, 
10, 11) 
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Table 2 


rpp a 


rev 


l\rl 


iiypxic Sequences tKKicj 




rv 

y 


14 


IPACIAvscK (10, 14), ToTCaYQGR (10, 12) 


Orr 1 / 


Q 

57 


1 A 


irAVHAotsf\ |Oj 


r5PP 17 


Q 

y 


1 A 


lrAvlftuCr\ \0-zf) 


vr r 1 f 


11/ 


7 
# 


•rAOlAocrt (i^-io, 4iJ, LWApuU ( lb/, YGTCJYQGK (lb, 21/ 






O 


» rMOWucn tO-57, 11, l^d, lO, lOJ, Y\j 1 Ol Y (*K?K (o-li J 


PPP 17 


ir> 

lu 


Q 

y 


TolV»lTVioK (sr-lUJ 


CPP 17 


To 


14 


1 rnWnVJC rv ^O/ 


CPP 17 


11 


7 


I rMWMOCr\ ^ 1 OJ, T OIV/lTUuK \10-1 f ) 


CPP 17 


11 


A 


IPACJAf^PR /Q.11 1 o\ ipAjOiAfiCoo *a\ vrrr*ivftOD ho\ ■ 


CPP 17 


11 


A 

<7 


I' M.v«#!r\vjtr\ 1=7- IvJ), T vj I YvJvaK (y) 


\Jt"r 1 # 


n 


1U 


iKALJIAfci&K (7) 


CPP 17 


11 


11 


YGTCIYQGR (7> " "' 


CPP 17 


11 


11 

1 w 


JPAOAfSPn MTV ~ " — ^— — — — 


CPP 17 


12 

t 




' i ANV* IM V3CTV H>-y) t LVVrtrtrO l*/t • o 1 Ol T UoK \# -yj 


CPP 17 


12 


q 


IPACJAftPR fR-0\ ~ — ^ ■ 


CPP 17 

\S1 W If 


12 • 


1D 




CPP 17 

V ■ Iff 


12 


12 


nucvnnnrCUInnUlrCVV VOLMVVUCoLArFS 


CPP 17 


12 
• 


14 


AnPVAAAPP/^lAAniPjrxAA/ci AW/ncci adlt /q\ 

/AiycvA\MM?^C\JI>\MlJlrnV VVoLAVVU (o/ 


CPP 17 


! 13 


R 


IPACIAAPR fQ 14\ l waftt vnrivrrD 4>i\ ■ 


CPP 17 


11 


q 

57 


IPAriAfiPR m 0 4 l »A/ACrr /n_ 4 n\ vrt/MVAPD tn ' ... 


CPP 17 


13 


10 


YGTCIYQGR (10) 


j 1 1 / 


1* 




IrMWWbtK ( YiJtt^rTUWJ'C (T3J 


1 rpp 17 


14 


57 1 


irMulMbcn (1U-1 1/, LWArCLr (10) 


rpp 17 


14 


in 


IrMUMutK (/J 


PPP 17 


14 1 




/U/cv/v\A^tlJI/V\UJI- , l:VVV£iLAWDt5LAPK (9-10) 


CPP 47 


14 






CPP 17 


1S 


57 


irAulAiafcK (o f 1Z, 14, 15, 17, IB), LWAFCC (9), YGTClYQGR (10, 15, 16) 


CPP 17 


IP 


O 


TljlLilYUvsK (0) 




o 


f 


wlooo Y r* I ViLADVK (1 0) 


CPP 1Q 


A 


7 
f 


A^PAU 1 UKFUUMK (10), MbbSYPi GLADVK (10), 
fwoof r* 1 ^LAL>viSAvsPAUTLl RPQDIVIK (1 0) 


VTiT £\r 


c 
D 


57 


Hrvj YrioiV (11/ 


] CPP 20 


10 




CEEDKEFTCR (9. 11) 


CPP 20 


10 


10 


CEEDKEFTCR (8), EPLDDYVNTQGPSLFSVTK (8) 


CPP 20 


11 


10 


CEEDKEFTCR (8), EPLDDYVNTQGPSLFSVTK (8) 


CPP 20 


11 


11 


EPLDDYVNTQGPSLFSVTK (8) 


CPP 40 


17 


20 


EDPTVSALLTSEK (9X VPSLVGSF3R (8-9) 


CPP 40 


17 


22 


VPSLVGSF1R (7) 


CPP 40 


18 


20 


VPSLVGSFIR (9) 


CPP 41 


12 


12 


LQNNENNJSCVER (9), STDTSCWPPTVQNAHJLSR (9) 


CPP 41 


14 


12 


TGESAEFVCK (9) 


CPP 41 


15 


11 


fTCTE EGWSPTPK (15-16). STDTSCVNPPTVQNAHILSR (15-16), TGESAEFVCKR (16) 
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Table 2 


CPP# 


CEX 


RP1 


Tryptic Sequences (RP2) 


CPP41 


16 


29 


YKPFSQVPTGEVFYYSCEYNFVSPSK (1) 


CPP 41 


17 


12 


CLHPCVISR <9), EIMENYNIALR (9), INHGILYDEEK (9), ITCTEEGWSPTPK (9). 
LQNNENNISCVER (9), SFWTRJTCTEEGWSPTPK (9), STDTSCVNPPTVQNAHILSR (9), 
TGESAEFVCKR (9), TTCWDGKLEYPTCAK (9) 


CPP 41 


17 


13 


CLHPCVISR (9), EATFCDFPK (9), EIMENYNIALR (9), GWSTPPK (9). INHGILYDEEK (9), 
ITCTEEGWSPTPK (9). LQNNENNISCVER (9), STDTSCVNPPTVQNAHILSR (9), 
TTCWDGKLEYPTCAK (9) 


CPP 41 


17 


14 


EIMENYNIALR (7), INHGILYDEEK (7), ITCTEEGWSPTPK (7). LQNNENNISCVER (7), 
STDTSCVNPPTVQNAHILSR (7), YKPFSQVPTGEVFYYSCEYNFVSPSK (7) 


CPP 41 


17 


16 


EATFCDFPK (5), EIMENYNIALR (5), LQNNENNISCVER (5), STDTSCVNPPTVQNAHILSR 
(5), TGESAEFVCK (5), TTCWDGKLEYPTCAK (5) 


CPP 41 


17 


18 


CLHPCVISR (4), EATFCDFPK (4), EIMENYNIALR (4), INHGILYDEEK (4), 
ITCTEEGWSPTPK (4), LEYPTCAK (4), SFWTR (4). STDTSCVNPFTVQNAKLSR (4). 
TTCWDGKLEYPTCAK \A) 


< 

CPP 41 


17 


20 


EATFCDFPK (3), EIMENYNIALR (3), INHGILYDEEK (3), ITCTEEGWSPTPK (3), 
LQN NENN 1 SCVER (3), STDTSCVNPPTVQNAHILSR (3), TGESAEFVCKR (3), 
TTCWDGKLEYPTCAK (3), YKPFSQVPTGEVFYYSCEYNFVSPSK (3) 


CPP 41 


17 


22 


EATFCDFPK (1). EIMENYNIALR (1), INHGILYDEEK (1), ITCTEEGWSPTPK (1). 
NGQWSEPPKCLHPCVISR<1), SFWTR ITCTEEGWSPTPK (1) r 
STDTSCVNPPTVQNAHILSR (1), TTCWDGKLEYPTCAK (1) 


* 






CLHPCVISR (1), EATFCDFPK (1), INHGILYDEEK (1), ITCTEEGWSPTPK (1) r 


CPP 41 


17 


26 


EATFCDFPK (1), EIMENYNIALR (1), GWSTPPK (1). INHGILYDEEK (1), 
ITCTEEGWSPTPK (1). STDTSCVNPPTVQNAHILSR <1), TGESAEFVCKR (1) 


CPP 41 


17 


27 


CLHPCVISR (1), EATFCDFPK (1), EIMENYNIALR (1). INHGILYDEEK (1), 
ITCTEEGWSPTPK (1). -LEYPTCAK (1), LQNNENNISCVER (1), SFWTR (1), 
STDTSCVNPPTVQNAHILSR (1) 


CPP 41 


17 


29 


CLHPCVISR (1), EATFCDFPK (1), EIMENYNIALR (1), INHGILYDEEK (1), 
ITCTEEGWSPTPK (1), LQNNENNISCVER (1), STDTSCVNPPTVQNAHILSR (1), 
TGESAEFVCK (1), TTCWDGKLEYPTCAK (1). YKPFSQVPTGEVFYYSCEYNFVSPSK (1) 


CPP 41 


17 


30 


CLHPCVISR (1 ). EIMENYNIALR (1 > f INHGILYDEEK (1 ). ITCTEEGWSPTPK <1 )> 
LQNNENNISCVER (1), STDTSCVNPPTVQNAHILSR (1), TTCWDGKLEYPTCAK <1) 


CPP 41 


18 


12 


CLHPCVISR (9-10), EATFCDFPK (9-10). EIMENYNIALR (9), INHGILYDEEK (9-10), 
ITCTEEGWSPTPK (9), LQNNENNISCVER (9-10). STDTSCVNPPTVQNAHILSR (9-10), 
TGESAEFVCKR (9). TTCWDGKLEYPTCAK (9-10) 


CPP 41 


18 


18 


EATFCDFPK (4). EIMENYNIALR (4), LQNNENNISCVER (4), STDTSCVNPPTVQNAHILSR 
(4). TTCWDGKLEYPTCAK <4) 


CPP 41 


18 


i 

19 


EATFCDFPK (4). EIMENYNIALR (4), INHGILYDEEK (4), ITCTEEGWSPTPK (4). 
LQNNENNISCVER (4). NGQWSEPPK (4), STDTSCVNPPTVQNAHILSR (4) 


CPP 41 


18 


20 


LQNNENNISCVER (3). STOTSCVNPPTVQNAH1LSR (3), TGESAEFVCKR (3). 
TTCWDGKLEYPTCAK (3) 


CPP 41 


18 


22 


EATFCDFPK (1). INHGILYDEEK (1), ITCTEEGWSPTPK (1), LQNNENNISCVER (1). 
STDTS CVKPPTVQNAHtLSR (1), TGESAEFVCKR (1). TTCWDGKLEYPTCAK (1) 
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Table 2 


CPP# 


CEX 


RP1 


Tryptic Sequence (RP2) 


CPP 41 


18 


26 


CLHPCV1SR (1X EATFCDFPK (1), EIMENYNIALR OX INHGILYDEEK (1), 
ITCTSGWSPTPK (1X STDTSCVNPPTVQNAHi LSR 0). TGESAEFVCK 0) 


CPP 41 


18 


29 


EATFCOFPK (1), EIMENYNIALR OX INHGILYDEEK (1). fTCTEEGWSKTPK OX 
LQNNENN1SCVER 0). STDTSCVKPPTVQNAHILSR 0). TTCWDGKLEYPTCAK (1) 


CPP 149 


13 


9 


AFTECCWASQLR (11), CCYDGACVNNDETCEQR (10) 


CPP 149 


A A 

14 


9 


AFTECCWASQLR (11), CCYDGACVNNDETCEQR (11) 


CPP 149 


15 


9 


AFTECCWASQLR (8), CCYDGACVNNDETCEQR (8) 


CPP 149 


15 


11 


At- ifcCCWASQLR (7) 


CPP 149 


16 


8 


Ah- 1 tUCVVASQLR (8-9) 


CPP 149 


16 


9 


AFTECCWASQLR (9) 


CPP 149 


17 


10 


AFTECCWASQLR (7). CCYDGACVNNDETCEQR (7) 


CPP 150 


14 


22 


TNFDNDIALVR (12) 


CPP 150 


14 


23 


SNALDI1 FQTDLTGQK (11), SSNNPHSPIVEEFQVPYNK <1 1), TNFDNDIALVR (10-11), 
VEDPESTLFGSV1R (8) 


CPP 150 


14 


24 


DWQITCLDGFEWEGR (12X EDTPN S VWEPAK (9). QFGPYCGHGFPGPLNIETK (8-12), 
SNALDIIFQTDLTGQK (8-9, 12), SSNNPHSPIVEEFQVPYNK <9), TNFDNDIALVR (8-12) 


CPP 150 


14 


25 


QFGPYCGHGFPGPLNIE IK (9), SNALD3IFQTDLTGQK (5), TNFDNDIALVR (5, 9), 
VEDPESTLFGSVIR (9) 


CPP 150 


14 


26 


GDSGGAFAVQDPNDK (8), SNALDIIFQTDLTGQK (7-S). TNFDNDIALVR (7) 


CPP 150 




14 


| 27 


GDSGGAFAVQDPNDK (8), QFGPYCGHGFPGPLNIETK (5). SNALDIIFQTDLTGQK (6. 8), 
TNFDNDIALVR (5, 7, 8) 


CPP 150 


14 


29 


10, 11, 12, 13, 14, 16, 17, 19, 20) 


CPP 150 


14 


30 


GDSGGAFAVQDPNDK (8), QFGPYCGHGFPGPLNIETK (11), TNFDNDIALVR (7) 


CPP 151 


10 


22 


SFEGLGQLEVLTLDHNQLQEVK (8) 


CPP 151 


12 


21 


SFEGLGQLEVLTLDHNQLQEVK (14) 


CPP 151 


13 


23 


LAELPADALGPLQR (12), LAYLQPALFSGLAELR (11), LEALPNSLLAPLGR (12). 
VAGLLEDTFPGLLGLR (1 1 -12) 


CPP 501 


2 i 


21 


EFLEDTCVQYVQK (7), TQSGLQSYLLQFHGLVR (7) 


CPP 501 


2 


22 


CFLGCELPPEGSR (6), EFLEDTCVQYVQK (6), TQSGLQSYLLQFHGLVR (4, 6) 


CPP 501 


2 


23 


EFLEDTCVQYVQK (6X TQSGLQSYLLQFHGLVR (5) 


ppp coo 


15 
i \j 


2 


ALNSHDVYHK (1) 


CPP 502 


17 


17 


ALNSIIDVYHK (9), GADVWFK (9) 


CPP 502 


17 


16 


ALNSilDVYHK (7), LLETECPQYIR (7) 


CPP 502 


18 


16 


ALNSIIDVYHK (8) 


CPP 502 


18 


17 


ALNSIIDVYHK (8-10), GADVWFK (9), GNFHAVYR (9X LLETECPQYIR (8, 10), MLTELEK 
(9) 


CPP 502 


18 


18 


ALNSIIDVYHK (6-7X GADVWFK (7), LLETECPQYIR (6-7) 


CPP 502 


18 


19 I ALNSIIDVYHK (7), LLETECPQYIR (7) 


CPP 502 


18 


20 


ALNSIIDVYHK (6), LLETECPQYIR (5) 


CPP 502 


18 


21 


LLETECPQYIR (6) 


CPP 502 


18 


22 


ALNSIIDVYHK (5), LLETECPQYIR (4) 
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Table 2 


CPP # 


CEX 


RP1 


Tryptic Sequences (RP2) 


CPP 502 


18 


23 


LLfc I fcCPQYIR (5) 


CPP 502 


18 


24 


ALNSJIDVYHK (3). LLETECPQYJR (3) 


CPP 502 


18 


25 


ALNSiiDVYhK (3J, LlEi £CPUY1K (3) 


CPP 502 


18 


26 


LLt IkUPUYlK (4) 


CPP 502 


18 


27 


ALNsUDVYHK (4), Li-fcrtCPQYIK (4) 


CPP 503 


16 


21 


HAlJ-GUPf-K (6) 


CPP 503 


17 


18 


CMGTVTLNQAR (7), FALLGDFFR (6-8), GSFDISCDK (7) 


CPP 503 


17 


19 


CMGTVTLNQAR (7). FALLGDFFR (6-7), IKDFLR (7) 


CPP 503 


17 


20 


FALLGDFFR (6), FALLGDFFRK (6) 


CPP 503 


17 


21 


FALLGDFFR (5-6), IKDFLR (6), TTQQSPEDCDFK (6) 


CPP 503 


17 


22 


CMGTVTLNQAR (4), FALLGDFFR (4, 6) 


CPP 503 


17 


23 


CMGTVTLNQAR (4). FALLGDFFR (4), QVLSYKEAVLR (4) 


CPP 503 


17 


24 


CMGTVTLNQAR (2-3), FALLGDFFR (3) 


CPP 503 


17 


25 


CMGTVTLNQAR (2), FALLGDFFR (3), FALLGDFFRK (2) 


CPP 503 


17 


28 


TTQQSPEDCDFKK (2) 


CPP 503 


17 


27 


CMGTVTLNQAR (4), FALLGDFFR (4), TTQQSPEDCDFKK (4) 


CPP 503 


17 


28 


FALLGDFFR (5) 


CPP 503 


17 


29 


CMGTVTLNQAR (4), FALLGDFFR (4). GSFDISCDK (4), TTQQSPEDCDFK <4), 
TTQQSPEDCDFKK (4) 


CPP 503 


17 


30 


FALLGDFFR (4) 


CPP 503 


18 ] 12 


A1DGINQR (10) 


-- OiM j 


I* 


1^ 


Vrrttn v r ljvUMK ^ /, r/-u_i.vji_/rrrt (cw r^LLOwr/ rv\ *,/ / 


CPP 503 


18 


20 


CMGTVTLNQAR (5-6), FALLGDFFR (5*6), FALLGDFFRK (5-6) 


CPP 503 


18 


21 


CMGTVTLNQAR (6), FALLGDFFR (5-7). IKDFLR (5) 


CPP 503 


18 


22 


CMGTVTLNQAR (4), FALLGDFFR (4-5), GSFOJSCDK (4), GSFDISCDKDNK (4), IKDFLR 
(4) 


CPP 503 


18 


23 


FALLGDFFR (4-5) 


CPP 503 


18 


24 


CMGTVTLNQAR (2), FALLGDFFR (2), FALLGDFFRK (2) 


CPP 503 


18 


25 


CMGTVTLNQAR (2), FALLGDFFR (2-3). FALLGDFFRK (2-3), GSFDISCDK (3) 


CPP 503 


18 


26 


CMGTVTLNQAR (4), FALLGDFFR (3-4), QVLSYKEAVLR (4) 


CPP 503 


18 


27 


CMGTVTLNQAR (4), FALLGDFFR (3-4), TTQQSPEDCDFK (4) 


CPP 503 


18 


28 


FALLGDFFR (3) 


CPP 503 


18 


29 


FALLGDFFR (3-4) 


CPP 503 


18 


30 


CMGTVTLNQAR (3-4). FALLGDFFR (3-4), FALLGDFFRK (3), TTQQSPEDCDFKK (4) 


CPP 504 


9 

• 


15 


VPLQQNFQDNQFQGK (15) 


CPP 504 


9 


16 


CDYWIR (11), ELTSELK (10), MYATIYELK (10-11), SLGLPENHIVFPVPIDQCIDG (10), 
SYPGLTSYLVR (11), TFVPGCQPGEFTLGN1K (10-11), VPLQQNFQDNQFQGK (10-11), 
WSTNYNQHAMVFFK (10), WYWGLAGNAILR (10) 


CPP 504 


9 


18 


VPLQQNFQDNQFQGK (8) 


CPP 505 


6 


8 


WEPPEKDDQLWLFPVQKPK (8) 


CPP 505 


6 


10 


AWMETEDTLGR (8) 


CPP 505 


7 


9 


WEPPEKDDQLWLFPVQKPK (12) 
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Table 2 


CPPg 


CEX 


RP1 


TryptJc Sequences (RP2) 


CPP505 


8 


10 


AWMETEDTLGR (8), WEPPEKDDQLWLFPVQKPK (8) 


CPPSOS 


8 


11 


AWMETEDTLGR (8-9), HWPSEQDPEKAWGAR (8), LLTTEEKPR (8), 
LWVMPNHQVLJLGPEEDQDHJYHPQ (8), WEPPEKDDQLWLFPVQKPK (8) 


CPP505 


9 


8 


LLTTEEKPR (11-13). WEPPEKDDQLWLFPVQKPK (12) 


CPP505 


9 


9 


AWMETEDTLGR (11. 13. 14. 15), GPILPGTK (13). HWPSEQDPEK (14). 
HWPSEQDPEKAWGAR (12), LLTTEEKPR (10. 13, 15. 17), 

A MM _[_■■» BWmvtM^AWmm** _-_P^_^___t a0BMx * _> __k M — •— ft _P_P^_ _ At m _A\ A A » 

WEPPEKDDQLWLFPVQKPK (11-15) 


CPP 505 


9 


10 


AWMETEDTLGR (9-11, 13. 14), DDQLWLFPVQKPK (9-10). 
LWVMPNHQVLLGPEEDQDHIYHPQ (10), WEPPEKDDQLWLFPVQKPK (11) 


CPPSOS ' 


9 


11 


AWMETEDTLGR (9-1 1.13), GPILPGTK (1 1-12), HWPSEQDPEKAWGAR (11), 
LLTTEEKPR (10-12), LLTTEEKPRGQGR (1 1 ), LWVMPNHQVLLGPEEDQDHIYHPQ (1 1 - j 
12), VLSPEPDHDSLYHPPPEEDQGEERPR (11), WEPPEKDDQLWLFPVQKPK (10-11) ' 


CPP 505 


10 


9 


AW Mt 1 tDTLGR (15-18), LLTTEEKPR (14-16), WEPPEKDDQLWLFPVQKPK (17) 


CPP 505 


A f\ 

10 


4 n 
10 


AWMETEDTLGR (8, 10, 11), HWPSEQDPEK (10), LLTTEEKPR (10), 
LWVMPNHQVlJLGPtEDQDHlYrtPQ (1 0-1 1 ) , W EPF E KD DQLWLF PVQKPK (9-11) 


CPP 505 


a n 

10 


4 4 
11 


AWM__TEDTL.GR (o), WcP PEKDDQLVVLF PVQKPK (11) 


PUD CAE 

CPP KID 


A A 

ll 




1 1 TTCC__>D t A A\ \fKfC ____tC_ l_T\rX~M Ct-irnt/D-' /<|C 4 — r. 

ULI VfcfcI\PK H4), VV__PPcl^uI_lLVVl__-PVU (1 &- 1 f) 


CPP 505 


44 
11 


4f\ 


AWMETEDTLGR (11). LWVMPNHQVLLGPEEDQDHIYHPQ (11), 
WEPPEKDDQLWLFPVQKPK (9. 11) 


/^oo cms 

CPP 505 


Q 

D 


in 


EVMPSJ QSLDALVK (5) 


CPP 505 


o 
O 




EVMP^QSLDALVK (5) 


« x « i V/ O -J 


-i 

■w 






^rvQ COO 

CPP 506 


o 
O 


ZD 


EVMPSJ QSLDALVK (3) 


CPP 507 


5 


7 


NANTFISPQQR (11-12) 


CPP 507 


0 


D 


NANTFISPQQR (8) 


CPP 507 


a 
0 


Q 

o 


NANTFISPQQR (7) 


jj CPP 507 


10 


7 


NANTFISP 




R (7-8, 11), YESHESMESYELNPFINRR (12) 


CPP 507 


10 


a 
o 


NANTFISPQQR (6) 


CPP 507 


<4 A\ 
11 


8 


NANTFISPQQR (8-9. 11, 12) 


/%ptp CAT 

CPP 507 


4 4 
11 


4 4 
1 1 


NANTFISPQQR (8) 


CPP 507 


12 


a 

8 


NANTFISPQQR (6-7. 12) 


CPP 507 


12 


9 


NANTFISPQQR (7-9) 


CPP 507 1 


13 


8 


NANTFISPQQR (8) 


CPP 507 


13 


9 


NANTFISPQQR (8-12) 


CPP 507 


14 


6 


NANTFISPQQR (10) 


CPP 507 


14 


7 ' 


NANTFISPQQR (10-14) 


CPP 507 


14 


9 


NANTFISPQQR (8) 


CPP 507 


14 


11 


NANTFISPQQR (7) 


CPP 507 


15 


7 


NANTFISP 




R(16) 


CPP 507 


16 


8 


NANTFISPQQR (7) 


CPPSOS 


4 


12 


GPETLCGAELVDALQFVCGDR (8) 


CPP 508 


5 


13 


GPETLCGAELVDALQFVCGDR (8-9) 
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Table 2 


CPP# 


CEX 


RP1 


Trypfac Sequences (RP2) 


POD CAO 

urr DUO 


i e 
5 


lo 


GPfc 1 lCXJAELVDALQFvCGDR (4) 


(JPP 50S 


0 


at 4 
11 


GPbTLCGAELVDALuFVCGDR (1Q) 


Orr Duo 


D 


jit 


GPfc 1 LCG At L VDALQFVC GDR (8) 


CPP 508 


6 


13 


GPETLCGAELVDALQFVCGDR (8, 12) 


C*PP 508 


T 

7 


10 


G Pfc FLCGAELVDALQFVCGDR (11) 


/♦Do eno 


7 


4 4 

1 1 


GFYFNKPTGYGSSSR (11), GPt 1 LCGAELVDALQFVCGOR (11-13), RAPQTG IVDECCFR 

/44 4 4V 

(lo-14) 


COD cnQ 


7 


40 


\sr yi-askp 1 GYG£>SSR (9), GPE r LL*aAfcLVUALQFVCGDR (8-9) 


Orr DUD 


0 
0 

i 




APU 1 Gl vDfcGGhK (6), GPE TU^aAEiLVDALQFVCGDR (7-8) 


CPP 508 


8 


13 


GrYhNKP I GYGSSSR (6-f ), GKb I LGGAELVDALQFVCGDR (6-8), RAPQTGIVDECCFR 
(7) 


CPP 508 


8 


14 


APQTGIVDECCFR (6). GFYFNKPTGYGSSSR (6), GPETLCGAELVDALQFVCGDR (6) 


CPP 508 


9 


11 


APQTGIVDECCFR (11), GPETLCGAELVDALQFVCGDR (10-11) 




9 


40 


APQTGIVDECCFR (1, 7. 6, 9), GFYFNKPTGYGSSSR (1, 7, 8), 
GPETLCGAELVDALQFVCGDR (1, 7, 8, 9, 10, 11, 12), RAPQTGIVDECCFR (7-8) 




9 


To 


APQTGlVDfcCOR (o-9), GpEI lCGAelVDALQFVCGDR (8-9). RAPQTGIVDECCFR (8) 


CPP 508 


9 


14 


GFYFNKPTGYGSSSR (7), GPETLCGAELVDALQFVCGDR (7). RAPQTGIVDECCFR (5-7) 




9 


15 


GFYFNKP I GYG55SR (7), RAPQTGIVDECCFR (7) 


UPP oUo 


9 


4 0 

ID 


GrYFNKP I GYGSS5K (5) 


Urr OUO 


9 


19 


RAPQTGIVDECCFR (3) 








APQTGIVDECCFR (10-12), GFYFNKPTGYGSSSR (8-10, 12), 


- 

CPP 508 


10 


12 


9, 12), RAPQTGIVDECCFR (8), RLEMYCAPLKPAK (7) 


CPP 508 


10 


13 


GFYFNKPTGYGSSSR (7), GPETLCGAE LVDALQ FVCGDR (7-8. 12) 


CPP 508 


11 


10 


GFYFNKPTGYGSSSR (1 1 ), GPETLCGAELVDALQFVCGDR (9-11) 


CPP 508 


11 


11 


APQTGIVDECCFR (10-12), GFYFNKPTGYGSSSR (9-11), GPETLCGAELVDALQFVCGDR 
(9-14), LEMYCAPLKPAK (11), RAPQTGIVDECCFR (9-12), RLEMYCAPLKPAK (10) 


CPP 508 


11 


12 


APQTGIVDECCFR (7-8), GPETLCGAELVDALQFVCGDR (7-8) 


CPP 508 


11 


13 


APQTGIVDECCFR (8), GPETLCGAELVDALQFVCGDR (8-9) 


CPP 508 


11 


19 


GPETLCGAELVDALQFVCGDR (3) 




19 


11 

1 1 


APQTGIVDECCFR (8-10). GFYFNKPTGYGSSSR (8-9), GPETLCGAELVDALQFVCGDR 
\o-iuj, ULMYOAPLKPAK (9), RAPQTGIVDECCFR (9), RLEMYCAPLKPAK (9) 


CPP 508 


12 


12 


APQTGIVDECCFR (7-8), GFYFNKPTGYGSSSR (7, 10), GPETLCGAELVDALQFVCGDR 
(7-13), RAPQTGIVDECCFR (7) 


CPP 508 


12 


13 


APQTGIVDECCFR (12), GFYFNKPTGYGSSSR (11-12), GPETLCGAELVDALQFVCGDR 
(8. 12) 


CPP 508 


12 


14 


RAPQTGIVDECCFR (6) 


CPP 508 


12 


19 


GPETLCGAELVDALQFVCGDR (3) 


CPP 508 


12 


20 


GPETLCGAELVDALQFVCGDR (1) 


CPP 508 


13 


11 


GFYFNKPTGYGSSSR (8), GPETLCGAELVDALQFVCGDR (9) 


CPP 508 


13 


12 


GPETLCGAELVDALQFVCGDR (8) 


CPP 508 


13 


13 

) 


GPETLCGAELVDALQFVCGDR (8-9) 
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TaWe 2 


CPP# 


CEX 


RP1 


Tryptic Sequences (RP2) 


Ann CAO 

CPP 505 


14 


10 


APQTGIVD ECCFR (9), GFYFNKPTGYGSSSR (9), GP^LClaAfcLVDALQFVCGDR (8-10), 
KAPQTG1 V DECCFR (8) 


CPP 5uo 


14 


n 


«PtTUCGAcLVUALQrVCi»L«< (3, 11 J 


CPP DUO 


14 


l£ 


Ajtxj 1 VjJVUfcCCf"K (o} r w*t 1 LCoAcLVIJALUrVCVsUK \'-Oj 


CPP oOo 


14 


1 O 


V3r*c 1 LUuAcLVUMLUrvuuUK (OJ 


POD CftO 

CPP DUo 


ID 


4 f\ 
IV 


ArUI GJVueCCrK (oJ r GPe J LCGAfcL VOALQr VCGurs \o- >U) 


Ur*r* DUO 


ID 


4 4 


aDrtTnx/ncoocD /a_o\ adcti pr*ci \/nAi nn/prriD ra_Q\ ? 
Mry 1 WVUtOWK (0-"/» GKfc 1 LUUntl. V UALUr VCVjUK |0-y) 


/»OD CAD 

•Wr DUO 


ID 


•to 


brTrWAribTwSwK 1* J, LH-'c t LCoAtu.VUAkwrVLrvyUK \f-<i, lU) 


add cna 

VrT'r' DUO 


ID 


l«3 


rvnen p/jACl \/rt ai <"%c\/j"VirYD #q\ 


\JrP DUO 


TO 


4 « 

1 1 


r*DCTi aaao \/TkAi /~*ei //"»/^r»D /n\ 
ort 1 LCldiAcLVUALWr VCGIJK ^9) 


ODD CfVQ 
\-rr*r* Duo 


JO 




ArU 1 til Vl/tUupK \o), GrYt^KPTeYGoSSH ID;, GPfcTLCoAcLVU/M_arvCGDR (0-8) 


CPP 508 


16 


13 


GPETLCGAELVDALQFVCGDR (7) 


Urr DUO 


1 f 


1 1 


ArU 1 tjl VUcUurK (o) f GrYrI>w\PTGYooooK \o), virfc: J LCGAfcLVUALwrvCGlJK (7-8) 


CPP 508 


17 


12 


APQTGIVDECCFR (6), GFYFNKPTGYGSSSR (6), GPETLCGAELVDALQFVCGDR (8-7) 


CPP DUO 


17 


4 O 

13 


APQTGlVDECCrR (6), GFYFNKPTGYGSSSR (6), GPETLCGAELvOALQFvCGDR (6-7) 


CPP DUO 


17 


15 


orci LCGAtLVUALQpvCGDR (5) 


CPP DUO 


17 




GPtT LC GAe L V LJALQ rVCG DK (1 / 


Odd cno 
V*TT^ DUO 


4 a 
JO 


44 

11 


rncn ofiACi \/nAi nrv/rr'no a<>\ 
v?r*C 1 LuuMcLVUnLUr VOoUn ^O/ 


CPP 508 


18 


12 


GPETLCGAELVDALQFVCGDR (6, 8) 


CPP 508 


18 


13 


GPETLCGAELVDALQFVCGDR (6-7) 


CPP 508 


18 


18 


GPETLCGAELVDALQFVCGDR (2) 


orr DUi; 




u 




CPP 509 


6 


8 


AQEPVKGPVSTKPGSCPilUR (7-8), CAMLNPPNR (7-8), CLKDTDCPGJK (7), 
VPFh4GQDPVK (7), VPFNGQDPVKGQVSVK (7) 


CPP 509 


8 


8 


AQEPVKGPVSTKPGSCPilUR (7), CAMLNPPNR (7) 


CPP 509 


9 


8 


VPFNGGDPVK (7) 


CPP 509 


10 


8 


GPVSTKPGSCPtUJR (8) 



Table 2 details, for each CPP, the sequences detected by mass spectrometry according to the 
procedures described in Example 1. hi addition, Table 2 indicates in which fractions of the CEX, 
RP1, and RP2 chromatographies each sequence was found. 

The CPPs listed in Table 2 were all identified as differentially expressed between individuals 
with cardiovascular disorders and control individuals using the procedure described in Example 1 . In 
particular, each CPP listed in Table 2 was found to vary between fee control and disease samples as 
detailed in Table 3 below. 
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Table 3 



Table 3 


CPP# 


Direction of variation 


CPP2 


Identified In Disease only 


CPP8 


Identified in Disease onJy 


CPP9 


Identified in Disease only 


CPP13 


Identified at a higher level in Controls 


CPP 14 


Identified at a higher level in Disease 


CPP17 


Identified in Disease only 


CPP 19 


Identified In Controls only 


CPP 20 


Identified In Disease only 


CPP 40 


Identified at a higher level in Controls 


CPP 41 


Identified at a higher level In Controls 


CPP 149 


Identified in Disease only 


CPP 150 


Identified at a higher level In Disease 


CPP 151 


Identified at a higher level In Disease 


CPP 501 


Identified at a higher level In Disease 


CPP 502 


Identified at a higher level in Controls 


CPP 503 


Identified at a higher level in Controls 


CPP 504 


Identified at a higher level in Controls 


CPP 505 


Identified at a higher teveJ in DJsease 


CPP 506 


Identified at a higher level in Disease 




< t • ■ ~ » «._■-.» • - * 


CPP 509 | 


Identified at a higher level in Disease 1 



10 



15 



One skilled in the art can use CPPs of the invention with a number of additional CPPs from 
Table 2, chosen using a suitable analysis of the levels of the CPPs from Table 2 measured in a number 
of diseased individuals and control individuals through the methods of Example 1. The strategies for 
discovering such combinations of CPPs need to regard each CPP as one variable and the disease as a 
joint, mulu-variate effect caused by interaction of these variables. 

Linear Discriminant Analysis (LDA) is one such analysis procedure, which can be used to 
detect significant association between a cluster of variables (i.e. CPPs) and cardiovascular diseases, hi 
performing LDA, a set of weights is associated with each variable (Le. CPP) so that the linear 
combination of weights and die measured values of the variables can identify the disease state by 
discriminating between subjects having a cardiovascular diease and subjects free from cardi vascular 
diseases. Enhancements to the LDA allow stepwise inclusion (or removal) of variables to optimize the 
discriminant power of the model. The results of the LDA is therefore a cluster of CPPs which can be 
used without limitations for diagnosis, prognosis, therapy or drug development. Other enhanced 
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versions of LDA, such as Flexible Discriminant Analysis permit the use of non-linear combinations of 
variables to discriminate a disease state from a normal state. The results of the discriminant analysis 
can be verified by post-hoc tests and also by repeating the analysis using alternative techniques such 
as classification trees. 

5 

Drug Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying candidate modulators (e.g^ small molecules and peptides, antibodies, peptidomimetics or 
other drugs) which bind to CPPs, have a modulatory effect on, for example, CPP expression or 

10 preferably CPP biological activity, m some embodiments small molecules can be generated using 

combinatorial chemistry or can be obtained from a natural products library. Assays may be cell based 
or non-cell based assays. Drug screening assays may be binding assays or more preferentially 
functional assays, as further described. 

When the invention is used for drug development, e.g., to determine the ability of a CPP 

1 5 modulator or drug candidate to induce an anti -cardiovascular disorder response, the body fluid 

analyzed for the level of at least one CPP is preferably from a non-human mammal. The non-human 
mammal is preferably one in which the induction of an anti-cardiovascular disorder response by 

Rodents (mice, rats, etc.; and primates are particularly suitable tor use m tins aspect ol trie invention. 
20 Agents that are found, using screening assays as further described herein, to modulate CPP 

activity by at least 5%, more preferably by at least 10%, still more preferably by at least 30%, still 
more preferably by at least 50%, still more preferably by at least 70%, even more preferably by at 
least 90 %, may be selected for further testing as a prophylactic and/or therapeutic anti-cardiovascular 
disease agent 

25 m another aspect, agents that are found, using screening assays as further described herein, to 

modulate CPP expression by at least 5%, more preferably by at least 10%, still more preferably by at 
least 30%, still more preferably by at least 50%, still more preferably by at least 70%, even more 
preferably by at least 90 %, may be selected for further testing as a prophylactic and/or therapeutic 
anti-cardiovascular disease agent. 

30 Agents that are found to modulate CPP activity may be used, for example, to modulate 

treatment regimens for cardiovascular disorders or to reduce the symptoms of a cardiovascular 
disorder alone or in combination with other appropriate agents or treatments. 

Protein array methods are useful for screening and drug discovery. For example, one member 
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of a receptor/ ligand pair is docked to an adsorbent, and its ability to bind fee binding partner is 
determined in the presence of the test substance. Because of the rapidity with which adsorption can be 
tested, combinatorial libraries of test substances can be easily screened for their ability to modulate 
the interaction. In preferred screening methods, CPPs are docked to the adsorbent Binding partners 
are preferably labeled, thus enabling detection of the interaction. Alternatively, in certain 
embodiments, a test substance is docked to the adsorbent The polypeptides of the invention are 
exposed to the test substance and screened for binding. 

hi other embodiments, an assay is a cell-based assay in which a cell which expresses a CPP or 
biologically active portion thereof is contacted wife a test compound and the ability of the test 
compound to modulate CPP activity determined. Determining the ability of fee test compound to 
modulate CPP activity can be accomplished by monitoring the bioactivity of fee CPP or biologically 
active portion thereof. The cell, for example, can be of mammalian origin, insect origin, bacterial 
origin or a yeast cell. 

In one embodiment, the invention provides assays for screening candi dat e or test compounds 
which are target molecules of a CPP or biologically active portion thereof, hi another embodiment, 
fee invention provides assays for screening candidate or test compounds which bind to or modulate 
the activity of a CPP or biologically active portion thereof. The test compounds of fee present 

iaiownuuneart, mcttKnng; moiogicai noranes; spanany aaure^aoie paranei sonapnase or soiuaon 
phase libraries; synthetic library methods requiring deconvohrtion; fee "one-bead one-compound' 
library method; and synthetic library methods using affinity chromatography selection. The biological 
library approach is used with peptide libraries, while fee other four approaches are applicable to 
peptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K. S. (1997) 
Anticancer Drug Des. 12:145, the disclosure of which is incorporated herein by reference in its 
entirety). 

Examples of methods for fee synthesis of molecular libraries can be found in the art, for example in: 
DeWitt et al. (1993) Proc. Natl. Acad. Sci. U.S A. 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. 
USA 91:1 1422; Zuckermarmet ai. (1994). J. Med Chem. 37:2678; Cho et al. (1993) Science 
261:1303; Carrell et al. (1994) Angew. Chem. hit Ed. Engl. 33:2059 and 2061; and in Gallop et aL 
(1994) J. Med. Chem. 37: 1233, fee disclosures of which are incorporated herein by reference in their 
entireties. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) Biotechniques 
13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), 
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bacteria (Ladner U.S. Pat No. 5,223,409), spores (Ladner U.S. Pat No. '409), plasmids (Cull et al. 
(1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 
249:386-390); (Devin (1990) Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad. Sci. 
87:6378-6382); (Felici (1991) J. Mol. Biol. 222:301-310); (Ladner supra). 

Detennining die ability of the test compound to modulate CPP activity can also be 
accomplished, for example, by coupling the CPP or biologically active portion thereof with a label 
group such that binding of the CPP or biologically active portion thereof to its cognate target 
molecule can be determined by detecting the labeled CPP or biologically active portion thereof in a 
complex. For example, the extent of complex formation may be measured by immunoprecipitating 
the complex or by performing gel electrophoresis. 

It is also within the scope of this invention to determine the ability of a compound to interact 
with its cognate target molecule without the labeling of any of the interactants. For example, a 
micTophysiometer can be used to detect the interaction of a compound with its cognate target 
molecule without the labeling of either the compound or the target molecule. McComxell, H. M. et al. 
(1992) Science 257: 1906-1912, the disclosure of which is incorporated by reference in its entirety. A 
microphysiometer such as a cytosensor is an analytical instrument that measures the rate at which a 
cell acidifies its environment using a Light-Addressable Potentiometric Sensor (LAPS). Changes in 

hva preferred embodiment, the assay comprises: contacting a celt winch expresses a Cff or 
biologically active portion thereof with a target molecule to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to modulate the 
activity of the CPP or biologically active portion thereof. Determining the ability of the test 
compound to modulate the activity of the CPP or biologically active portion thereof comprises: 
determining the ability of the test compound to modulate a biological activity of the CPP expressing 
cell (e.g., interaction with a CPP target molecule, as discussed above). 

hi another preferred embodiment, the assay comprises contacting a cell which is responsive to 
a CPP or biologically active portion thereof with a CPP or biologically active portion thereof, to form 
an assay mixture, contacting the assay mixture with a test compound, and detennining the ability of 
the test compound to modulate the activity of the CPP or biologically active portion thereof. 
Determining the ability of the test compound to modulate the activity of the CPP or biologically 
active portion thereof comprises determining the ability of the test compound to modulate a biological 
activity of the CPP-responsive cell. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell expressing 
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a CPP target molecule (i.e. a molecule with which CPPs interact) with a test compound and 
determining the ability of the test compound to modulate the activity of the CPP target molecule. 
Determining the ability of the test compound to modulate die activity of a CPP target molecule can be 
accomplished, for example, by assessing the activity of a target molecule, or by assessing the ability 
of the CPP to bind to or interact with the CPP target molecule. 

Determining the ability of the CPP to bind to or interact with a CPP target molecule, for 
example, can be accomplished by one of the methods described above for directly or indirectly 
determining binding, hi a preferred embodiment, the assay includes contacting the CPP or 
biologically active portion thereof with a known compound which binds said CPP (e.g., a CPP 
antibody or target molecule) to form an assay mixture, contacting the CPP with a test compound 
before or after said known compound, and determining the ability of the test compound to interact 
with the CPP. Determining the ability of the test compound to interact with a CPP comprises 
determining the ability of the test compound to preferentially bind to the CPP or biologically active 
portion thereof as compared to the known compound DetOTnining the ability of the CPP to bind to a 
CPP target molecule can also be accomplished using a technology such as real-time Biomolecular 
Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Chem. 63:2338-2345 and 
Szabo et aL (1995) Curr. Opin. Struct. Biol. 5:699-705, the disclosures of which are incorporated 

i±iiGi<u,uvLia uiA^m uiLie, wimxHU tavcimg any o± uic imcravuuna ^c.g., x>iA.corcj. cnanges in cne 
optical phenomenon of surface plasmon resonance (SPR) can be used as an indication of real-time 
reactions between biological molecules. 

In another embodiment, the assay is a cell-free assay in which a CPP or biologically active 
portion thereof is contacted with a test compound and the ability of the test compound to modulate the 
activity of the CPP or biologically active portion thereof is determined. In a preferred embodiment, 
determining the ability of the CPP to modulate or interact with a CPP target molecule can be 
accomplished by <ietermming the activity of the target molecule. For example, the activity of the 
target molecule can be determined by contacting the target molecule with the CPP or a fragment 
thereof and measuring induction of a cellular second messenger of the target (e.g., cAMP, STAT3, 
Akt, intracellular Ca2+, diacylgrycerol, JP3, etc.), detecting catalytic/emzymatic activity of the target 
for an appropriate substrate, detecting the induction of a reporter gene (comprising a target-responsive 
regulatory element operativefy linked to a nucleic acid encoding a detectable marker, e.g., hiciferase), 
or detecting a target-regulated cellular response, for example, signal transduction or protein^rotem 
interactions. 
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The cell-free assays of the present invention are amenable to use of both soluble and/or 
membrane-bound forms of isolated proteins (e.g. CPPs or biologically active portions thereof or 
molecules to which CPPs targets bind), hi the case of cell-free assays in which a membrane-bound 
form an isolated protein is used it may be desirable to utilize a solubihzing agent such that the 



agents include non-ionic detergents such as n-octylghicoside, n-dodecylglucosidern- 
dodecytaaltoside, octanoyl-N-methylgJucamide, decanoyl-N-methylglucamide, Triton TM X-100, 
Triton TM X-l 14, Thesh TM, Isotridecypoly(ethylene glycol ether)n,3-I(3- 
cholamidopropyl)dimetiVlamminio]- 1 -propane sulfonate (CHAPS), 3-[(3- 

cholanridopropyl)dnnethylan^ sulfonate (CHAPSO), or N-dodecyl=N,N- 

dimethyl-3-ammonio-l -propane sulfonate. 



In more than one emt 
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lent of the above assay methods of the present invention, it may be 
desirable to immobilize either a CPP or its target molecule to facilitate separation of complexed from 
uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the 
assay. Binding of a test compound to a CPP, or interaction of a CPP with a target molecule in the 
presence and absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants and by any immobilization protocol described herein. Alternatively, the 
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using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
assays of the invention. For example, either a CPP or a CPP target molecule can be immobihzed 
utilizing conjugation of bio tin and streptavidin. Biotinylated CPP or target molecules can be prepared 
from biotin-NHS (N -bydroxy-sucxrmimi de) using techniques well known in the art (e.g., biotinylation 
kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 96 well 
plates (Pierce Chemical). Alternatively, antibodies reactive with CPP or target molecules but which 
do not interfere with binding of the CPP to its target molecule can be derivatized to the wells of the 
plate, and unbound target or CPP trapped in die wells by antibody conjugation. Methods for detecting 
such complexes, in addition to those described above for the GST-immobilized complexes, include 
immunodetection of complexes using antibodies reactive with the CPP or target molecule, as well as 
enzyme-linked assays which rely on detecting an enzymatic activity associated with the CPP or target 
molecule. 
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cell is contacted with a candidate compound and the expression of CPP mRNA or protein in the cell is 
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determined. The level of expression of CPP mRNA or protein in fee presence of the candidate 
compound is compared to the level of expression of CPP mRNA or protein in the absence of the 
candidate compound The candidate compound can then be identified as a modulator of CPP 
expression based on this comparison. For example, when expression of CPP mRNA or protein is 
5 greater (statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of CPP mRNA or protein expression. 
Alternatively, when expression of CPP mRNA or protein is less (statistically significantly less) in the 
presence of the candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of CPP mRNA or protein expression. The level of CPP mRNA or protein expression in the 

10 cells can be determined by methods described herein for detecting CPP mRNA or protein. 

In yet another aspect of the invention, the CPP can be used as "bait proteins" in a two-hybrid 
assay or three-hybrid assay (see, e.g., U.S. Pat No. 5*283,317; Zervos et al. (1993) Cell 72:223-232; 
Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; 
Iwabuchi et aL (1993) Oncogene 8:1693-1696; and Brent WO94/10300, the disclosures of which are 

1 5 incorporated herein by reference in their entireties), to identify other proteins, which bind to or 
interact with CPPs ("CPP-binding proteins" or "CPP-bp") and are involved in CPP activity. Such 
CPP-binding proteins are also likely to be involved in the propagation of signals by the CPP or CPP 

i tte two-fcyond system is oased on tne modular nature 01 mosi transcription iactors, wmca 
20 consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two different 
DNA constructs. In one construct, the gene that codes for a CPP or a fragment thereof is fused to a 
gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified protein 
("prey" or "sample") is fused to a gene that codes for the activation domain of the known transcription 
25 factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming a CPP -dependent 
complex, the DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked 
to a transcriptional regulatory site responsive to the transcription factor. Expression of the reporter 
gene can be detected and cell colonies containing the functional transcription factor can be isolated 
30 and used to obtain the cloned gene which encodes the protein which interacts with the CPP. 

This invention further pertains to novel agents identified by the above-described screening 
assays and to processes for producing such agents by use of these assays. Accordingly, in one 
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iment, the present invention includes a compound or agent obtainable by a method comprising 
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the steps of any one of the aforementioned screening assays (e.g., cell-based assays or cell-free 
assays). 

Accordingly, it is within the scope of this invention to further use an agent identified as 
described herein in an appropriate animal model. For example, an agent identified as described herein 
5 (e.g., a CPP modulating agent, or a CPP -binding partner) can be used in an animal model to 

determine the efficacy, toxicity, or side effects of treatment with such an agent. Alternatively, an 
agent identified as described herein can be used in an animal model to determine the mechanism of 
action of such an agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

10 The present invention also pertains to uses of novel agents identified by the above-described 

screening assays for diagnoses, prognoses, prevention, and treatments as described herein. 
Accordingly, it is within the scope of the present invention to use such agents in the design, 
formulation, synthesis, manufacture, and/or production of a drug or pharmaceutical composition for 
use in diagnosis, prognosis, or treatment, as described herein. For example, in one embodiment, the 

1 5 present invention includes a method of synthesizing or producing a drug or pharmaceutical 

composition by reference to the structure and/or properties of a compound obtainable by one of the 
above-described screening assays. For example, a drug or pharmaceutical composition can be 



ceii wnicii expresses a i^fr target molecule is contacted wrai a xesc compound ana cne aoiuty ox tne 
20 test compound to bind to, or modulate the activity of, the CPP target molecule is determined. In 

another exemplary embodiment, the present invention includes a method of synthesizing or producing 
a drug or pharmaceutical composition based on the structure and/or properties of a compound 
obtainable by a method in which a CPP or biologically active portion thereof is contacted with a test 
compound and the ability of the test compound to bind to, or modulate the activity of the CPP or 
25 biologically active portion thereof is determined. 

Animal based drug screening 



30 cardiovascular disease. Animal-based model systems of cardiovascular disease include, but are not 
limited to, non-recombinant animals and transgenic animals. 

Non-recombinant animal models for cardiovascular disease may include, for example, genetic 
models. Such genetic cardiovascular disease models include apoB or apoR deficient pigs (Rapacz, et 




carried out in n 
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al„ 1986, Science 234:1573-1577) and Watanabe heritable hypolipidemic (WHHL) rabbits (Kita et 
aL, 1987, Proc. Nail. Acad. Sci U.S.A. 84: 5928-5931). Nan-recombinant, non-genetic animal models 
of atherosclerosis may include, for example, pig, rabbit, or rat models in which the animal has been 
exposed to either chemical wounding through dietary supplementation of LDL, or mechanical 
wounding through balloon catheter angioplasty, for example. 

As indicated in the art (Ferns, G. A. A. et al. (1991) Science, 253:1 129-1 1-32) the rat carotid 
artery injury model of restenosis can be a useful indication of potential therapeutic action. An 
example of this method is described in US Patent 6500859, the disclosure of which is incorporated 
herein by reference. Briefly, the protocol approved by the National Institute on Aging Animal Care 
and use Committee used 6 month Wistar rats from die GRC colony anesthetized with 20 mg/kg body 
weight pentobarbital, 2 mg/kg body weight ketamine, and 4 mg/kg body weight xylazine 
intraperitoneally. The left external carotid artery was carmulated with 2-French Fogarty embolectomy 
catheter, inflated with saline and passed three times up and down the common carotid artery to 
produce a distending, deendothelializing injury. The animals were treated with an appropriate dosage 
of die test substance or with vehicle alone (e.g., based on body weight per day in an appropriate 
solution such as 1:2:2:165 DMSO:Cremophor EL dehydrated ethanol .-phosphate buffered saline) by 
intraperitoneal injection beginning 2 hours after injury. Test substance or vehicle alone was 

(S treated and 10 vehicle-treated) were anesthetized as above ana the carotid artery was lsoiaieu ana 
fixed in 10% buffered formalin and embedded in paraffin. Cross sections of the carotids were 
mounted on microscope slides and stained with hematoxylin and eosin stain. The image of the carotid 
artery was projected onto a digitizing board and the cross sectional areas of die intima and the media 
were measured. Reduction of the neointimal area (thickening) indicates that the test substance is an 
effective antirestenosis agent 

Interfering with die recirculation of bile acids from the lumen of the intestinal tract is found to 
reduce the levels of serum cholesterol in a causal relationship. Epidemiological data has accumulated 
which indicates such reduction leads to an improvement in the disease state of atherosclerosis 
(Stedronsky, Biochirrrica et Biophysica Acta, 1210, 255-287 (1994)). Inhibition of cholesteryl ester 
transfer protein (CETP) has been shown to effectively modify plasma HDL/LDL ratios, and is 
expected to check the progress and/or formation of certain cardiovascular diseases, inhibition of 
CETP should lead to elevation of plasma HDL cholesterol and lowering of plasma LDL cholesterol, 
thereby providing a therapeutically beneficial plasma lipid profile (McCarthy, Medicinal Res. Revs., 
13, 139-59 (1993)). An in vivo assay for compounds that inhibit rat ileal uptake of 14 OTaurocholate 
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into bile (CETP inhibition) is disclosed in US Patent 6489366 and Une, et al. Biochimica et 
Biophysica Acta, 833, 196-202 (1985), disclosures of which are incorporated herein by reference. 

Briefly, male Wistar rats (200-300 g) are anesthetized with inactin (100 mg/kg). Bile ducts are 
cannulated with a lOinch length of PE10 tubing. The small intestine is to be exposed and laid out on a 
gauze pad, A canulae (1/8" luer lock, tapered female adapter) is inserted at 12 cm from the junction of 
the small intestine and the cecum. A slit is cut at 4 cm from this same junction (utilizing a 8 cm length 
of ileum). Twenty milliliters of warm Dulbecco's phosphate buffered saline, pH 6.5 (PBS) is to be 
used to flush out the intestine segment The distal opening is cannulated with a 20 cm length of 
silicone tubing (0.02° LD. times 0.037" OJD.). The proximal cannulae is hooked up to a peristaltic 
pump and the intestine is washed for 20 min with warm PBS at 0.25 ml/min. Temperature of the gut 
segment is to be monitored continuously. At the start of the experiment, 2.0 ml of control sample ( l4 C- 
taurocholate at 0.05 mCi/mL with 5 mM nori-radiolabeled taurocholate) is loaded into the gut segment 
with a 3 ml syringe and bfle sample collection is begun. Control sample is infused at a rate of 0.25 
ml/min for 21 min. Bile samples fractions are to be collected every 3 minute for the first 27 minutes 
of the procedure. After the 21 min of sample infusion, the ileal loop is to be washed out with 20 ml of 
warm PBS (using a 30 ml syringe), and then the loop is to be washed out for 21 min with warm PBS 
at 0.25 ml/min. A second perfusion is initiated as described above but this with test compound being 



S max tor the rim z / nun, li neeessary, a ttura per rusion is pertormea as aoove mat typically contains 
the control sample. 

Li addition, measurement of hepatic cholesterol concentration is a useful assay for 
determining the effectiveness of a test substance against cardiovascular disorders. In this assay, liver 
tissue is weighed and homogenized in chloroformrmethanol (2: 1). After homogenization and 
centrifugation the supernatant is separated and dried under nitrogen. The residue is to be dissolved in 
isopropanol and the cholesterol content measured enzymatically, using a combination of cholesterol 
oxidase and peroxidase, as described by Allain, C. A. et al., Clin. Chem., 20, 470 (1974) (herein 
incorporated by reference). 

Similarly, serum cholesterol may be determined as follows. Total serum cholesterol is 
measured enzymatically using a commercial kit from Wako Fine Chemicals (Richmond, Va.); 
Cholesterol CI 1, Catalog No. 276-64909. HDL cholesterol may be assayed using this same kit after 
precipitation of VLDL and LDL with Sigma Chemical Co. HDL Cholesterol reagent, Catalog No. 
352-3 (dextran sulfate method). Total serum triglycerides (blanked) (TGI) is also assayed 
enzymatically with Sigma Chemical Co. GPO-Trinder, Catalog No. 337-B. VLDL and LDL 
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(VLDL+LDL) cholesterol concentrations are calculated as the difference between total and HDL 
cholesterol. A reduction in VLDL+LDL cholesterol in the test substance-treated sample relative to 
control is indicative of an effective anti-cardiovascular disorder agent 

A dog model for evaluating lipid lowering drugs may also be utilized, for example, as 
described in US Patent 6489366. 

Briefly, male beagle dogs, obtained from a vendor such as Marshall farms and weighing 6-12 
kg are fed once a day for two hours and given water ad libitum. Dogs may be randomly assigned to a 
dosing groups consisting of 6 to 12 dogs each, such as: vehicle, i.g.; 1 mg/kg, i.g.; 2 mg/kg, i.g.; 4 
mg/kg, i.g.; 2 mg/kg, p.o. (powder in capsule). Intra-gastric dosing of a therapeutic material dissolved 
in aqueous solution (for example, 0.2% Tween 80 solution [polyoxyethylene mono-oleate, Sigma 
Chemical Co., St Louis, Mo.]) may be done using a gavage tube. Prior to initiating dosing, blood 
samples may be drawn from the cephalic vein in the morning before feeding in order to evaluate 
serum cholesterol (total and HDL) and triglycerides* For several consecutive days animals are dosed 
in the morning, prior to feeding. Animals are to be allowed 2 hours to eat before any remaining food 
is removed. Feces are to be collected over a 2 day period at the end of the study and may be analyzed 
for bile acid or lipid content Blood samples are also to be taken, at the end of the treatment period, 
forcoi nparison with pre-study serum lipid levels. Statistical significance will be determined using the 

Serum lipid measurement is measured similarly. Blood is collected from the cephalic vein of 
fasted dogs in serum separator tubes (Vacutainer SST, Bee ton Dickinson and Co., Franklin Lakes, 
N J.). The blood is centrifuged at 2000 rpm for 20 minutes and the serum decanted. Total cholesterol 
may be measured in a 96 well format using a Wako enzymatic diagnostic kit {Cholesterol CD) (Wako 
Chemicals, Richmond, Va.), utilizing the cholesterol oxidase reaction to produce hydrogen peroxide 
which is measured colorimetrically. A standard curve from 0.5 to 10 ug cholesterol is to be prepared 
in the first 2 columns of the plate. The serum samples (20-40 ul, depending on the expected lipid 
concentration) or known serum control samples are added to separate wells in duplicate. Water is 
added to bring the volume to 100 ul in each well. A 100 ul aliquot of color reagent is added to each 
well and the plates will be read at 500 inn after a 15 minute incubation at 37 degrees centigrade. 

HDL cholesterol may be assayed using Sigma kit No. 352-3 (Sigma Chemical Co., St Louis, 
Mo.) which utilizes dextran sulfate and Mg ions to selectively precipitate LDL and VLDL. A volume 
of 150 ul of each serum sample is to be added to individual microfuge tubes, followed by 15 ul of 
HDL cholesterol reagent (Sigma 352-3). Samples are to be mixed and centrifuged at 5000 rpm for 5 
minutes. A 50 ul aliquot of the supernatant is to be then mixed with 200 ul of saline and assayed using 
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the same procedure as for total cholesterol measurement 

Triglycerides are measured using Sigma kit No. 337 in a 96 well plate format This procedure 
will measure glycerol, following its release by reaction of triglycerides with lipoprotein lipase. 
Standard solutions of glycerol (Sigma 339-1 1) ranging from 1 to 24 ug are to be used to generate the 
5 standard curve* Serum samples (20-40 ul, depending on the expected lipid concentration) are added to 
wells in duplicate. Water is added to bring the volume to 100 ul in each well and *Q0 ul of color 
reagent is also added to each well. After mixing and a 1 5 minute incubation, the plates will be read at 
540 ran and the triglyceride values calculated from the standard curve. A replicate plate is also to be 
run using a blank enzyme reagent to correct for any endogenous glycerol in the serum samples. 

10 Test compounds may be evaluated for their effect on serum glucose and serum insulin in 

db/db mice (C578BL/KsJ-dh/db Jcl) as described in US 6462046, disclosure of which is incorporated 
herein. The compounds are dissolved in a vehicle (e.g,, consisting of 2% Tween80 in distilled water) 
and administered orally. Dosage is determined by body weight All aspects of the work including 
e x perimentation and disposal of the animal a is performed in general accordance with the International 

15 Guiding Principles for Biomedical Research Involving Animals (CIOMS Publication No. ISBN 92 

90360194, 1985). Glucose-HA Assay kits (Wako, Japan) are used for determination of serum glucose 
and ELISA Mouse Insulin Assay kits (SPI bio, France) are utilized for determination of insulin. An 

The ajiirnals are divided dnto twenty groups of. lour animals eacfi. ifie animais weign t/- j 
20 gms at age 8-10 weeks. During the experiment the animals are provided free access to laboratory 

chow (Fwusow Industry Co., Taiwan) and water. Prior to any treatment a blood sample (pretreatment 
blood) is taken from each animal. Four groups of animals, the vehicle groups, receive only doses of 
the vehicle. Each of the vehicle groups receive 100, 30, 10 or 1 ml/kg body weight of the vehicle 
orally. A trightazone solution (10 ml/kg body weight in tween 80/water) is administered orally to the 
25 four positive control groups in doses of 100, 30, 10 and 1 ml/kg body weight respectively. The test 
compound is similarly administered orally as a solution to four groups of animals with each group 
receiving a different dose of the compound. The vehicle, positive control and test compound solutions 
are administered to the groups immediately, 24 hours and 48 hours after drawing the pretreatment 
blood. Blood is withdrawn (post treatment blood) 1 .5 hours after administration of the last dose. The 
30 serum glucose are determined enzymatically (Mutaratose-GOD) and the insulin levels by ELISA 
(mouse insulin assay kit). The mean SEM of each group is calculated and the percent inhibition of 
serum glucose and insulin obtained by comparison between pretreatment blood and post treatment 
blood. The percentage of reduction of the serum glucose and insulin levels in the post treatment blood 
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relative to die pretreatment blood is determined and the Unpaired students t test applied for the 
comparison between the control and test solution groups and the vehicle group. A significant 
difference is considered at P<0.05. TrogHtazone, as an effective anti-cardiovascular disorder agent, 
results in a reduced glucose level at 10 mg/kg body weight (25 +/- 2%). 

US Patent 6121319, disclosure of which is incorporated herein, describes an assay for the 
progression of atherosclerosis in hypcrchol es terol emic rabbits. The rabbits are sacrificed and aortas 
obtained. The aortas are stained with sudan-4 and the extent of staining analyzed. The percent aortic 
surface area covered by lesions in test substance treated and untreated lipid-fed rabbits is graphed. 
The aortas of the rabbits treated with an effective anti-atherosclerotic agent have less staining, 
indicating decreased atherosclerosis. In addition, sections of die aortas are imnrunostained for 
VCAM-1 expression or macrophage accumulation using antibodies for VCAM-1 or Ram-1 1 antigen. 
Reduced VCAM-1 expression and macrophage accumulation compared to control treated samples are 
indicative of an effective agent 



15 



Reduction in DDL cholesterol may also be determined in a primate model. For example, 
Cynomolgus monkeys are made hypercholesterolemic prior to test compound dosing by feeding a 
high fat cholesterol diet The monkeys are then dosed orally with the test compound or control vehicle 
for two weeks. A reduction in die percentage serum LDL cholesterol in the monkeys over this time 



20 Pharmaceutical Compositions 

When polypeptides of the present invention are expressed in soluble form, for example as a 
secreted product of transformed yeast or mammalian cells, they can be purified according to standard 
procedures of the art, including steps of ammonium sulfate precipitation, ion exchange 
chromatography, gel filtration, electrophoresis, affinity chromatography, according to, e.g., "Enzyme 

25 Purification and Related Techniques," Methods in Enzymology, 22:233-577 (1 977), and Scopes, R., 
Protein Purification: Principles and Practice (Springer-Verlag, New York, 1982) provide guidance in 
such purifications. Likewise, when polypeptides of the invention are expressed in insoluble form, for 
example as aggregates or inclusion bodies, they can be purified by appropriate techniques, including 
separating the inclusion bodies from disrupted host cells by centrifugation, solubilizing the inclusion 

30 bodies with chaotropic and reducing agents, diluting the sohibilized mixture, and lowering the 

concentration of chaotropic agent and reducing agent so that the polypeptide takes on a biologically 
active conformation. The latter procedures are disclosed in the following references, which are 
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incorporated by reference: Winkler et al, Biochemistry, 25: 4041-4045 (1986); Winkler et al, 
Biotechnology, 3: 992-998 (1985); Koths et al, U.S. patent 4,569,790; and European patent 
applications 86306917.5 and 86306353.3. 

Compounds capable of detecting or modulating a CPP or a CPP biological activity, including 
small molecules, peptides, CPP nucleic acid molecules, and anti- CPP antibodies of the invention, can 
be incorporated into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise a pharmaceutical ly acceptable carrier. As used herein the language 
"pharmaceuticalry acceptable carrier" is intended to include any and all solvents, dispersion media, 
coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and agents for 
pharmaceutical^ active substances is well known in the art Except insofar as any conventional media 
or agent is incompatible with the active compound, use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with its 
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intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, 
and rectal administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous 



solution, nxeaoils, poiyettryiene grycois, giycenne, propylene giycoi or ocner syntneuc solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, 
citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. 
pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made 
of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include 
physiological saline, bacteriostatic water, Cremophor EL® (B ASF, Parsippany, N J.) or phosphate 
buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent 
that easy syringability exists. It must be stable under the conditions of manufacture and storage and 
must be preserved against the contaminating action of microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanoL, polyol (for 
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example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable 
mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the 
like, hi marry cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols 
such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays absorption, 
for example, aluminum monostearate and gelatin. 

Where the active compound is a protein, e.g., an anti-CPP antibody, sterile injectable 
solutions can be prepared by incorporating the active compound in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as required, followed 
by filtered sterilization. Generally, dispersions are prepared by incorporating the active compound 
into a sterile vehicle which contains a basic dispersion medium and other required ingredients from 
those enumerated above, in the case of sterile powders for the preparation of sterile injectable 
solutions, the preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient phis any additional desired ingredient from a previously Sterile- 
filtered solution thereof 

^auj. wiyni|/ yoiuuiid ^wiv«ti*^ an mcrt uuucm ui cm Calais earner, iiicy can oe 

enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. For administration by inhalation, the compounds are delivered in the 
form of an aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. Systemic administration can also be by transrnucosal 
or transdermal means. For transrnucosal or transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transrnucosal administration, detergents, bile salts, and fusidic acid 
derivatives. Transrnucosal administration can be accomplished through the use of nasal sprays or 
suppositories. For transdermal administration, the active compounds are formulated into ointments, 
salves, gels, or creams as generally known in the art Most preferably, active compound is delivered to 
a subject by intravenous injection. 

hi one embodiment, die active compounds are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release formulation, including 
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implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be 
used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and 
potyiactic acid. Methods for preparation of such formulations will be apparent to those skilled in the 
art. Hie materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be used as pharmac euti cally acceptable carriers. 
These can be p r e p ar ed according to methods known to those skilled in the art, for example, as 
described in U.S. Pat. No. 4,522,81 1 , the disclosure of which is incorporated herein by reference in its 
entirety. 

m a further embodiment, the active compound may be coated on a microchip drug delivery 
device. Such devices are useful for controlled delivery of proteinaceous compositions into the 
bloodstream, cerebrospinal fluid, lymph, or tissue of an individual without subjecting such 
compositions to digestion or subjecting the individual to injection. Methods of using microchip drug 
delivery devices are described in US Patents 6123861, 5797898 and US Patent application 
200201 19 176A1 , disclosures of which are hereby incorporated in their entireties. 

It is especially advantageous to formulate oral or preferably parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 

a predetermined quantity of acttvc eom potm a caicuiatea a> pxuuucc uic ucsucu uicia^eutuj oucwv m 
association with the required pharmaceutical carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
•of compounding such an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for detennining the LD50 
(the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of 
the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it 
can be expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic indices are 
preferred. While compounds that exhibit toxic side effects may be used, care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in formulating a 
range of dosage for use in humans. The dosage of such compounds lies preferably within a range of 
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circulating concentrations that include fee ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the method of the invention, fee therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
5 used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

The pharmaceutical compositions can be included in a container, pack, or dispenser together 
wife instructions for administration. 

1 0 Cardiovascular~disorder therapy 

The CPP modulators and anti-CPP antibodies of the invention can be used in the treatment or 
prevention of CPP-related disorders. Thus, in one aspect fee invention relates to pharmaceutical 
compositions containing an antibody, antibody fragment, or peptide modulator of CPP, preferably 
containing a pharmaceutically acceptable carrier or diluent. The carrier or diluent is preferably 

15 adapted for oral, intravenous, intramuscular or subcutaneous administration. Pharmaceutical 

compositions may comprise or consist essentially of any of fee CPP modulators, anti-CPP antibodies, 
or anti-CPP antibody fragments described herein. 

Such agents may fce used advantageously in combination with a CPP-reiated composition. 

20 For example, cell cycle inhibitors and proto-oncogenes (Simari and Nabel, Semin. Intervent. 

Cardiol. 1:77-83 (1996)); NO (nitric oxide) donor drugs; pro-apoptotic agents such as bcl-x (Pollman 
et aL, Nature Med. 2:222-227 (1998)); herpes virus thymidine kinase (tk) gene and systemic 
ganciclovir (Ohno et a!., Science 265:781-784 (1994); Guzman et aL, Proc. Natl. Acad. Sci. USA 
91:10732-10736 (1994); Chang et ai., MoL Med. 1:172-181 (1995); and Simari et aL, Circulation 

25 92:1-501 (1995)) have been exploited to treat atherosclerosis, restenosis and neointimal smooth 

muscle proliferation. Disclosures of the above references are hereby incorporated in their entireties. 

Anti-thrombotic agents useful in combination wife fee compositions of the invention include, 
for example, inhibitors of the Ilb/HTa integrin; tissue factor inhibitors; and anti-thrombin agents. An 
antiarrhythmic agent, such as a local anesthetic (class I agent), sympathetic antagonist (class II agent), 

30 antifibrillatory agent (class HI agent) calcium channel agent (class IV agent) or anion antagonist (class 
V agent) as described in Vukmir, Am L Emer, Med. 13:459-470 (1995); Grant, PACE 20:432-444 
(1997); Assmann L, Curr. Med. Res. Opm. 13:325-343 (1995); and Lipka et al., Am. Heart J. 
130:632-640 (1995), disclosures of which are hereby incorporated by reference in their entireties, may 
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also be used. Examples of class I agents include: procainamide; quinidine or disopyramide; lidocaine; 
phenytoin; tocainide or mexiletine; encainide; flecainide; lorcainide; propafenone (ID) or moricizine. 
Sympathetic antagonists include: propranolol, esmolol, metoprolol, atenelal, or acebutolol. Examples 
of antifibrillatory agents are bretylium, amiodarone, sotalol (II) or N-ac etylprocainamide . Class IV 
5 agents include verapamil, dfltiazem, and bepridil, and anion antagonists such as alinidine. 

Congestive heart failure therapeutic agents include TNF inhibitors such as Embrel.TM. 
(Tmmunex Corp.; Seattle, Wash.), TBC1 125 1, or an ACE (angiotensin converting enzyme) inhibitor, 
such as Natrecor (nesiritide; Scios, Inc.). Angiogenic agents, for example, recombinant VEGF 
isoforms, such as rhVEGF developed by Genentech; a nucleic acid molecule encoding the 121 amino 

10 acid isoform of VEGF (BioByPass.TM.; GenVec/Parke Davis); or a nucleic acid encoding VEGF-2 
(Vascular Genetics, Inc.); F1BLAST.TM., a recombinant form of FGF-2 being developed by Scios, 
Inc. (Mountain View, Calif.) and Wyetfa Ayerst Laboratories (Radnor, Pa.), GENERX.TM., or an 
adenoviral gene therapy vector encoding FGF-4 developed by Collateral Therapeutics (San Diego, 
Calif) and Schering AG (see Miller and Abrams, Gen. Engin. News 1 8:1 (1998), disclosure of which 

15 is hereby incorporated by reference in its entirety), are also useful in combination with the CPP- 
related compositions of the invention. Finally, calcium antagonists, such as amlodipine (Marche et 
al., bit J. Cardiol. 62 (Suppl.):S17-S22 (1997); Schachter, Int. J. Cardiol. 62 (Suppl.):S85-S90 

(Ed.) Calcium Antagonists pages 157-260 London: Academic Press (1988); Schachter, hit J. Cardiol. 
20 62 (Suppl.):S9-S 15 (1997)) are also advantageous therapeutic agents for cardiovascular disorders. 

Having generally described this invention, a further understanding can be obtained by 
reference to certain specific examples which are provided herein for purposes of illustration only, and 
are not intended to be limiting unless otherwise specified. 

25 

EXAMPLES 

Example I: Characterization of CPP levels in disease and control populations 

Subjects enrolled in the Duke Databank for Cardiovascular Disease were selected on the basis 
30 of coronary artery disease (CAD). A total of 24 1 CAD patients and control individuals were further 
matched for gender, age, and ethnicity and individuals with plasma abnormalities were excluded. A 
set of 53 CAD patients and a set of 53 control individuals were established. Six liters of plasma were 
pooled from each set An aliquot of plasma was retained from each individual, thus allowing a 
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positive result in the pooled sample to be confirmed for each member of the population. Such 
confirmation is valuable to erase possible confounding effects of an individual with an aberrant level 
of a specific polypeptide that is not related to a cardiovascular disorder. Two and a half liters of 
pooled plasma from each population were subjected to separation by multiple chromatography steps 
according to the MicroprotTM process as follows: 
Step 1 : HSA/IgG depletion 

125 ml frozen plasma were defrost and filtered on 0.45 jim sterile filter in a sterile hood. 

Filtrate was injected on two inline columns of respectively 300 ml of HSA ligand Sepharose 
fast Flow column (Amersham, Upsala, Sweden), 5cm ID, 1 5 cm length; and 100 ml Protein G 
Sepharose fast Flow column (Amersham, Upsala, Sweden), 5 cm ID, 5 cm length. 

Columns were equilibrated and washed with 50 mM PCM buffer, pH 7.1, 0. 15M NaCl. Flow 
rate was 5 ml/mm. 

Non-retained fraction (350 ml) was frozen until second step. Twenty runs were performed. 
Step 2: Gel Filtration / Reverse Phase Capture step 

Sample from step 1 was defrosted and filtered on 0.45 urn sterile filter in a sterile hood. 

Filtrate was injected on two in line gel filtration columns: 2X9.5 litres Superdex 75 
(Amersham, UK) column, 14 cm ID, 62 cm length. Column was equilibrated with 50mM PQ4 buffer 

150 ml PLRPS fPotymer Labs, UK). Precolumn was switched for sample injection. Gel filtration was 
performed at a flow rate of 40 ml/min. 

Low molecular weight proteins (<20 kDa) were oriented to in line reverse phase capture 
column: 50 ml PLRPS 100 angstroms (Polymer labs, UK). The three-way valve controlling injection 
on PLRPS column was switched at a cut-off of 33 mAU (280 nro) to send gel filtration eluate into 
reverse phase capture column. This cut-off value was established by first using SDS-PAGE to 
provide an estimated range of OD values and by subsequently evaluating three cut-off values (high, 
median and low values of OD range). The final cut-off value was chosen to maximize the low 
molecular weight protein obtained, with a low molecular protein proportion of at least 85%. Low 
molecular weight proteins and peptides were ehrted from reverse phase capture PLRPS column by one 
column volume gradient of 0.1% TFA, 80% CH3CN in water. 

Eluate fractions (50 ml) were frozen until next step. Twenty runs were performed. At the end of this 
step, all reverse phase eluates were defrosted, pooled (1 liter) and shared in 7 polypropylene 
containers (143 ml). Containers were kept at — 20°C until use for next step. 
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Step 3: Cation Exchange 

Sample from step 2 (147 ml) was defrosted and mixed with an equal volume of cation 
exchange buffer A (Gly/HCl buffer 50 mM, pH 2.7, urea 8M). 

Sample was injected on a 100 ml Source 15S column (Amersham, Upsala, Sweden), 35 mm 
ID, 100 mm length. Column was equilibrated and washed with buffer A. Flow rate was 10 ml/min. 

Proteins and peptides were eluted with step gradient from 1 00% buffer Ajuntil 100 % buffer 
B (buffer A containing 1M NaCl): 

3 column volumes 7.5% B (75 mM NaCl) 

3 column volumes 10% B (100 mM NaCl) 

3 column volumes 17.5% B (175 mM NaCl) 

2 column volumes 22.5% B (225 mM NaCl) 

2 column volumes 27.5% B (275 mM NaCl) 

2 column volumes 100% B (1 M NaCl) 

45 to 60 fractions were collected based on peak. Seven runs were conducted. After 7 runs 
were achieved, fractions were pooled intra and inter run in order to obtain 1 8 fractions. Fractions were 
kept at -20°C until use for next step. 

Step 4: Reduction/Alkylation and Reverse Phase HPLC Fractionation 1 

iracuons was reaucea wun aumoerymnfok itsit*,, otrmm y 5 nours at 5r<j) and aikyiated with 
iodoacetamid (120 mM, 1 hour 25°C in the dark). The latter reaction was stopped with the addition of 
DTE (30 mM) followed by acidification (TFA, 0.1 %). The fractions were then injected on an 
Uptispher C8, 5 urn, 300 angstroms column (Interchim, France), 21 mm ID, 150 mm length. Injection 
was performed with a 10 ml/min flow rate. 

C8 column was equilibrated and washed with 0.1 % TFA in water (solution A). Proteins and 
peptides were eluted with abqjhasic gradient from 100% A until 100% B (0.1% TFA, 80% CH3CN 
in water) in 60 min. Flow rate was 20 ml/min. Thirty fractions of 40 ml were collected. 

Based on the measured optical density (OD) at 280 nm of each fraction, which reflects the 
protein concentration in that fraction, aliquots of similar protein content were created for each 
fraction. 

All aliquots were frozen and kept for further use except one per fraction which was dried with 
a Speed Vac (Savant, Fischer, Geneva) after addition of 500 ul 10% glycerol in water in each fraction, 
in order to. prevent excess drying. Dried fractions were kept at -20°C until use for next step. 
Step 5: Reverse Phase HPLC Fractionation 2 
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Dried angles from step 4 were resuspended in 1 ml of solution A (0.03% TFA in water) and 
injected on a Vydac LCMS C4 column, 5 micrometers, 300 angstroms (Vydac, USA), 4.6 mm ID, 150 
mm length. Flow rate was 0.8 ml/min. 

C4 column was equilibrated and washed with solution A and proteins and peptides were 
eluted with a biphasic gradient adapted to elution position of the sample in Reverse Phase HPLC 
Fractionation 1. Intact mass data were acquired using EJectrospray Ion Trap Massspectrometry. 
Sixteen different gradients were used with a CH3CN concentration range minus and phis 5% CH3CN 
of RP1 fraction corresponding solvent concentration. For proteins eluted in RP1 with a solvent 
concentration equal to or greater than 30 % CH3CN, the starting elution conditions for the RP2 
gradient was set, in CH3CN percentage, at the RP1 elution concentration minus 30%. Twenty-four 
ehited fractions were collected in a deep well plate, adopting optimized different collection 
configurations designed for optimal SpeedVac concentration and further robotic treatment 
Step 6: Mass detection 

About 13,000 fractions were collected following reverse phase HPLC fractionation 2 into 96- 
well deep well plates (DWP). A small proportion (23%) of the volume was diverted to online 
analysis using LC-ESI-MS (Bruker Esquire). Aliquots of undigested proteins were mixed with 
MALDI matrices, and spotted on MALDI plates together with mass calibration standards and 



A wo auieKm iv*aa^ mamces wereemptoyeu: sinapic acid (SA), also known as sinapinic acid, 
trans-3,5-<Jimethoxy^hydroxycinnamic acid, and alpha^^o^hydroxycinnamic acid (HCCA). 
MALDI plates were subjected to mass detection using Bruker Reflex m MALDI MS apparati. The 
96-well plates were stored at +4 C. 

96-well plates (DWP) were recovered and subjected to two sequential concentration steps. 
Volumes were concentrated from 0.8 ml to about 50 microl per well by drying with a SpeedVac, and 
then resolubilized to ca. 200 microl and reconcentrated to about 50 microl per well, and stored at +4 
C. Proteins were then digested by re-buffering, adding trypsin to the wells, sealing and incubating the 
plates at 37 C for 12 hours, followed by quenching (addition of formic acid to bring the pH down to 
2.0). The concentration of trypsin to be added to the wells was adjusted based on the OD at 280 am 
recorded for each particular fraction. This ensured an optimal use of trypsin and a complete digestion 
of the most concentrated fractions. Automated spotting devices (Bruker MALDI sample prep. 
Robots) were used to deposit a volume from each well, pre-mixed with a HCCA matrix onto a 
MALDI plate together with sensitivity and mass calibration standards. MALDI plates were analyzed 
using a Bruker Reflex m MALDI MS device. Contents from each well of the 96 well plates were 
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analyzed with LOESI-MS-MS Bruker Esquire ESI Ion-Trap MS devices. 

Step 7: Detection and Identification of Low Abundance Peptides in Human Plasma 

Separated fractions are further subjected to mass spectrometry (both matrix-assisted laser 
desorption/ionization (MALDI) and MS-MS) for separation and detection. 

Intact mass data, Peptide Mass Fingerprints and peptide sequence data were integrated for 
protein identification and characterization. Proteins were identified using Mascot-software (Matrix 
Science Ltd., London, UK), and results from peptide identification were checked by manual analysis 
of the spectra. 

Among the proteins identified by this process, Calgranulin A (S100 calcium-binding protein 
A8, of SwissProt accession number P05 109), was found to be expressed to a greater extent in the 
pooled sample from controls than in the pooled sample from CAD patients (e.g., peptides from die 
protein were observed in twice as many control fractions compared with disease fractions, and the 
cumulated scores obtained during mass spectra identification of this protein were 2.5-fold higher for 
the control sample). Calgranulin A has been characterized as a pro-inflammatory protein (Odink, et 
al., Nature 330 (6143), 80-82 (1987) and numerous later references). It is expressed by extravasatmg 
myeloid cells during mflammatory responses, where it binds to a glycosaminogfycan structure on 
epithelial cells (Robinson, et al., JBC 277:3658-^5 (2002)). Interestingly, PCT publication WO 

^HKn^ ^"^l.^^r 4V a „ r o ftf -tV— A * - C — .1 * - _ : CC? : . 11.. 

xr&nuMieiosi^±v±us^ver, r^i puoucaaon w^omay/tf xnsctosestneiise oi Calgranulin A as an 
inhibitor of vascular membrane growth for prevention of myocardial infarction and hypertension. It 
appears therefore that the protein separation and identification approach described herein is efficient 
at providing proteins which, when detected at higher levels in the control sample man in the disease 
sample, have a beneficiary effect for the treatment of the studied disease. 

Conversely, the methods of protein separation and identification described in this Example 
have allowed the identification of the Matrix Gla Protein (of SwissProt accession number P08493) as 
overexpressed in the pooled sample from CAD patients by comparison with the pooled sample from 
controls (e.g., peptides from the protein were observed in almost twice as many disease fractions 
compared with control fractions, and the cumulated scores obtained during mass spectra identification 
of this protein were 2-fold higher for the disease sample). MGP is a vitamin K-dependent protein 
which associates with the organic matrix of bone and cartilage. Mori, et al. demonstrated that MGP is 
capable of inhibiting vascular calcification (FEBS Letters 433: 19-22 (1998)). MGP levels are 
increased in atherosclerotic plaques as a likely feedback response to vessel calcification. PCT 
publications WO 01/02863 and WO 01/25427 describe MGP as a biomarker for atherosclerosis and 
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cardiovascular disorders. It appears therefore that the protein separation and identification approach 
described herein is efficient at providing proteins which have a recognized use in die diagnosis of the 
studied disease. 

The liyptic peptides of SEQ ID NOs: 3-4, 7-12, 14-19 and 22-25, listed in Figure 1 and Table 
1, were observed by tandem mass spectrometry at a higher level in the Coronary Artery Disease 
sample. 

Hie presence of a tryptic peptide indicates that a polypeptide comprising the amino acid 
sequence of SEQ ID NO:3 (QSGEDNQDLAISFAGNGLSALR) or SEQ ID NO:4 (ESLSGVCEISGR) 
was present at a higher level in the starting plasma sample from individuals with CAD. Such 
polypeptides include those represented by the sequences of SEQ ID NOs: 1 and 2 (CPP 12). 
Furthermore, the tryptic peptide of SEQ K>:3, which is the most detected one, belongs to the 
propeptide part of the protein (Ghosh, et aL, Nat Immunol. (2002) 3:583-90). The increase in disease 
of this propeptide therefore most probably reflects an incomplete processing of the propeptide into the 
Defensin 5 peptide. 

The presence of a tryptic peptide indicates that a polypeptide comprising the amino acid 
sequence of SEQ ID NO:7 (LQCYNCPNPTADCK), SEQ IDNO:8 (TAVNCSSDFDACLITK), SEQ 
ID NO:9 (AGLQVYNK), SEQ ID NO: 10 (FEHCNFNDVTTR), SEQ ID NO: 1 1 

iwarong pnu»ina saiiipicimmuiwY^imis wjfnwuu. oucii pujypepaacs include inose represented by 
the sequences of SEQ ID NOs:5 and 6 (CPP 15). 

The presence of a tryptic peptide indicates that a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 14 (WAGQDAVTVLLGTR) SEQ ID NO: 15 (NDLSPTTVMSEGAR), SEQ 
ID NO: 1 6 (LQAVTDDHIR), SEQ ID NO: 1 7 (YV A VMPPHIGDQPLTG AYTVTLDGR), or SEQ ID 
NO: 19 (CLTTDEYDGHSTYPSHQYQ) was present at a higher level in the starting plasma sample 
from individuals with CAD. The tryptic peptide of SEQ ID NO: 1 8 (HDLGHFMLR) was detected in 
the control sample. Such polypeptides include those represented by the sequences of SEQ ID NO: 13 
(CPP 16). 

The presence of a tryptic peptide indicates that a polypeptide comprising the amino acid 
sequence of SEQ ID NO:22 (LYSVHRPVK), SEQ ID NO:23 (QCIHQLCFTSLR), SEQ ID NO:24 
(SNYFRIJ>PCENVDLQRPNGL), or SEQ ID NO:25 (LPPCENVDLQRPNGL) was present at a 
higher level in the starting plasma sample from individuals with CAD. Such polypeptides include 
those represented by the sequences of SEQ ID NOs:20 and 21 (CPP 18). 

It is interesting to note that the CPP 1 8 comprises at least two distinct species. The first one 
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elutes in RP1 fractions 6 and 7, and contains the following tryptic peptides: 
SNYFRLPPCENVDLQRPNGL and LPPCENVDLQRPNGL (see Table 1). The second one elutes in 
RP1 fractions TO and 1 1, and is therefore more hydrophobic, and contains only the following, 
different, tryptic peptides: QCIHQLCFTSLR and LYSVHRPVK. 

5 

The methods of protein separation and identification according to the invention are extremely 
sensitive. The MicroprotTM process is able to detect very low abundance proteins with a plasma 
concentration in the range of a few hundreds of pM. The accuracy was confirmed while carrying out 
the presently described methods. In particular, proteins with a well-characterized role in 
10 atherosclerosis and CAD were differentially detected in CAD and control samples. 

Example 2: Chemical Synthesis of CPPs 

m this example, a CPP of the invention is synthesized. Peptide fragment intermediates are 
first synthesized and then assembled into the desired polypeptide. 
15 A CPP can initially be prepared in, e.g. 5 fragments, selected to have a Cys residue at the N- 

to mi nus of the fragment to be coupled. Fragment 1 is initially coupled to fragment 2 to give a first 
product, then after preparative HPLC purification, the first product is coupled to fragment 3 to give a 

give a third product, finally, alter preparative HitLx, ^puraticafloii, tae tauu piourat is cuupivu a/ 
20 fragment 5 to give the desired polypeptide, which is purified and refolded 
Thio ester formation 

Fragments 2, 3, 4, and 5 are synthesized on a thioester generating resin, as described above. For this 
purpose the following resin is prepared: S-acetylthioglycolic acid pentafluorophenylester is coupled 
to a Leu-PAM resin under conditions essentially as described by Hackeng et al (1999). m the first 
25 case, the resulting resin is used as a starting resin for peptide chain elongation on a 0.2 mmol scale 

after removal of the acetyl protecting group with a 30 rmn treatment with 10% mercaptoethanol, 10% 
piperidine in DMF. The of the N-terminal Cys residues of fragments 2 through 5 are protected by 
coupling a Boc-thioproline (Boc-SPr, i.e. B oc-L-thioproline) to the terminus of the respective chains 
instead of a Cys having conventional N° or S p protection, e.g. Brik et al, J. Org. Chem, 65: 3829-3835 

30 (2000). 

Peptide synthesis 

Solid-phase synthesis is performed on a custom-modified 433A peptide synthesizer ft 
Applied Biosystems, using in situ neutralization/2-(lH-benzotriazol-l-yI>l, 1,1,3,3- 
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tetramethyluronium bexafhioro-pbosphate (HBTU) activation protocols for stepwise Boc chemistry 
chain elongation, as described by Schnolzer et al, Iht J. Peptide Protein Res., 40: 1 80-193 (1992). 
Each synthetic cycle consists of N*-Boc -removal by a 1 to 2 mm treatment with neat TFA, a 1-min 
DMF flow wash, a 10-min coupling time with 2.0 mmol of preactivated Boc-amino acid in the 
5 presence of excess DIEA and a second DMF flow wash. Na-Boc-amino acids (2 mmol) are 

pieactivated for 3min with 1 .8mmol HBTU (0.5M in DMF) in the presence of excess DIEA (6mmol). 
After coupling of Gin residues, a dichloromethane flow wash is used before and after deprotection 
using TFA, to prevent possible high temperature (TFA/DMF>-catalyzed pyrrolidone carboxylic acid 
formation. Side-chain protected amino acids are Boc-Arg(p-toluenesulfonyl)-OH, Boc-Asn(xanthyl)- 

1 0 OH, Boc-Asp(0-cyclohexyl>OH, Boc-Cys(4-memyIbenzyl>OH, Boc-Glu(0-cyclohexyl)-OH, Boc- 
His(dinitrophenylbenzyl>OH, Boc-Lys(2-Cl-Z)-OH, Boc-Ser(benzyl>OH, Boc-Thr(benzyl>OH, 
Bw>Trp{cyclohexylcarbonyl)-OH and Bec-Tyr(2-Br-Z)-OH (Orpagen Pharma, Heidelberg, 
Germany). Other amino acids are used without side chain protection. C-terminal Fragment 1 is 
synthesized on Boc-LeuO-CH 2 -Pam resin (0.71mmol/g of loaded resin), while for Fragments 2 

15 through 5 machine-assisted synthesis is started on the Bo c-Xaa-S-CH 2 -CO-Leu-Pain resin. This resin 
is obtained by the coupling of S-acetylthioglycolic acid pentafhiorophenylester to a Leu-PAM resin 
under standard conditions. The resulting resin is used as a starting resin for peptide chain elongation 

mercaptoethanol, 10%piperidinemDMF. 

20 After chain assembly is completed, the peptide fragments are deprotected and cleaved from 

the resin by treatment with anhydrous hydrogen fluoride for lhr at 0°C with 5% p-cresol as a 
scavenger. In all cases except Fragment 1, the imidazole side chain 2,4-dinitrophenyi (DNP) 
protecting groups remain on His residues because the DNP-removal procedure is incompatible with 
C-terminal thio ester groups. However DNP is gradually removed by thiols during the ligation 

25 reaction, yielding unprotected His. After cleavage, peptide fragments are precipitated with ice-cold 
diethylether, dissolved in aqueous acetonitrile and lyophilized. The peptide fragments are purified by 
RP-HPLC with a CI 8 column from Waters by using linear gradients of buffer B (acetonitile/0.1% 
trifluoroacetic acid) in buffer A (H 2 O/0.1% trifluoroacetic acid) and UV detection at 214nm. Samples 
are analyzed by electro spray mass spectrometry (ESMS) using an Esquire instrument (Brucker, 

30 Bremen , Germany), or like instrument 
Native chemical ligations 

As described more frilly below, the ligation of unprotected fragments is performed as follows: 
the dry peptides are dissolved in equimolar amounts in 6M guanidine hydrochloride (GuHCI), 0.2M 
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phosphate, pH 7.5 in order to get a final peptide concentration of 1-8 mM at a pH around 7, and 1% 
benzylmercaptan, 1% thiophenol is added. Usually, the reaction is carried out overnight and is 
monitored by HPLC and electrospray mass spectrometry. Hie ligation product is subsequently treated 
to remove protecting groups still present Opening of the N-terminal thiazolidme ring further required 
the addition of solid methoxamine to a 0.5M final concentration at pH3.5 and a further incubation for 
2h at 37 C C. A 10-fold excess of Tris(2-carboxyethyl)phosphine is added before preparative HPLC 
purification. Fractions containing the polypeptide chain are identified by ESMS, pooled and 
ryophilized. 

The ligation of fragments 4 and 5 is performed at pH7.0 in 6 M GuHCl. The concentration of 
each reactant is 8mM, and 1% ben2ylmercaptan and 1% thiophenol were added to create a reducing 
environment and to facilitate the ligation reaction. An almost quantitative ligation reaction is observed 
after overnight stirring at 37°C. At this point in the reaction, CH 3 -0-NH 2 .HC1 is added to the solution 
to get a 0 SM final concentration, and the pH adjusted to 3.5 in order to open the N-terminal 
thiazolidme ring. After 2h incubation at 37°C, ESMS is used to confirm the completion of the 
reaction. The reaction mixture is subsequently treated with a 10-fold excess of Tris(2- 
carboxyethylphosphine) over the peptide fragment and after ISmin, the ligation product is purified 
using the preparative HPLC (e.g., C4, 20-60% CH 3 CN, 0,5% per rnin), lyophilized, and stored at — 

The same procedure is repeated tor the remaining ligations with slight modifications. 
Polypeptide Folding 

The full length peptide is refolded by air oxidation by dissolving the reduced lyophilized protein 
(about 0.1 mg/mL) in 1M GuHCl, lOOmM Tris, lOmM methionine, pH 8.6 After gentle stirring 
overnight, fee protein solution is purified by RP-HPLC as described above. 

Example 3; Preparation of CPP antibody compositions 

Substantially pure CPP or a portion thereof is obtained. The concentration of protein in the 
final preparation is adjusted, for example, by concentration on an Amicon filter device, to the level of 
a few micrograms per ml. Monoclonal or polyclonal antibodies to the protein are then prepared as 
described in the sections titled "Monoclonal antibodies** and "Polyclonal antibodies." 

Briefly, to produce an anti-CPP monoclonal antibody, a mouse is repetitively inoculated with 
a few micrograms of the CPP or a portion thereof over a period of a few weeks. The mouse is then 
sacrificed, and the antibody producing cells of the spleen isolated. The spleen cells are fused by 
means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by 
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growth of the system on selective media comprising aminopterin (HAT media). The successfully 
fused cells are diluted and aliquots of the dilution placed in wells of a microliter plate where growth 
of the culture is continued. Antibody-producing clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as ELISA, as originally described by 
Engvall, E., Meth. EnzymoL 70: 419 (1980), the disclosure of which is incorporated herein by 
reference in its entirety. Selected positive clones can be expanded and their monoclonal antibody 
product harvested for use. Detailed procedures for monoclonal antibody production are described in 
Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. Section 21-2, the 
disclosure of which is incorporated herein by reference in its entirety. 

For polyclonal antibody production by immunization, polyclonal antiserum containing 
antibodies to heterogeneous epitopes in the CPP or a portion thereof are prepared by immunizing a 
mouse with the CPP or a portion thereof, which can be unmodified or modified to enhance 
iramunogenicity. Any suitable nonhuman animal, preferably a non-human mammal, may be selected 
including rat, rabbit, goat, or horse. 

Antibody preparations prepared according to either the monoclonal or the polyclonal protocol 
are useful in quantitative immunoassays which determine concentrations of CPP in biological 
samples; or they are also used senu^niantitatively or qualitatively to identify the presence of antigen 

expiring me protein or reaming oie wvei&vi me protem m trie oocty. 
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CLAIMS 

1 . A method of screening for and /or diagnosis of a cardiovascular disorder in a subject, 
comprising the steps of. 

a) detecting and /or quantifying the level of a polypeptide in a biological sample from said 
subject, wherein the polypeptide is selected from: 

i) a polypeptide comprising the amino acid sequence selected from one of the 
groups consisting of SEQ ID NO: 1-2, 5-6, 1 3, and 20-2 1 ; 

ii) a variant, with at least 75% sequence identity, having one or more amino acid 
substitutions, deletions or insertions relative to the amino acid sequence shown 
in SEQ ID NOs: 1-2, 5-6, 13, and 20-21; and 

iii) a fragment of a polypeptide as defined in i) or ii) above which is a least ten 
amino acids long; and 

b) comparing said level to that of a control sample, 

wherein an increase in said level relative to that of the control is indicative of a cardiovascular 
disorder. 

2. A method of predicting a cardiovascular disorder in a subject, comprising the steps of: 

subject, wherein the polypeptide is selected from: 

i) a polypeptide comprising the amino acid sequence selected from one of the 
groups consisting of SEQ ID NO: 1-2, 5-6, 13, and 20-21; 

ii) a variant, with at least 75% sequence identity, having one or more amino acid 
substitutions, deletions or insertions relative to the amino acid sequence shown 
in SEQ ID NOs: 1-2, 5-6, 13, and 20-21; and 

iii) a fragment of a polypeptide as defined in i) or ii) above which is a least ten 
amino acids long; and 

b) comparing said level to that of a control sample, 

wherein an increase in said level relative to that of the control indicates a risk of developing a 
cardiovascular disorder. 

3. The method of claim 1 or 2, wherein the level of two or more of the polypeptides of claim 1 or 
2 are detected and /or quantified in a biological sample from said subject 

4. The method of any one of claims 1 to 3, wherein said cardiovascular disorder is Coronary 
Artery Disease (CAD). 
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The method of any one of claims I to 4, wherein said biological sample is plasma. 

Hie method of any one of claims 1 to 5, wherein said polypeptide is detected and /or quantified 
by mass spectrometry. 

Hie method of any one of claims 1 to 5, wherein said polypeptide is detected and /or quantified 
by Enzyme-Linked Immuno Sorbent Assay. 

An isolated polypeptide comprising the amino acid sequence selected from selected from one of 
the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25, wherein said polypeptide is 
fused to a heterologous polypeptide sequence. 

An anti-Cardiovascular disorder Plasma Polypeptide (CPP) antibody that selectively binds to a 
polypeptide comprising the amino acid sequence selected from selected from one of the groups 
consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25. 

10. A method of binding an antibody to a Cardiovascular disorder Plasma Polypeptide (CPP) 
comprising the steps of: 

permit antibody binding; and 
ii) removing contaminants. 

11. The method of claim 10, wherein said antibody is attached to a label group. 

12. Hie method of claim 10, wherein said sample is human plasma. 

13. A method of identifying a Cardiovascular disorder Plasma Polypeptide (CPP) modulator 

comprising the steps of: 

i) contacting a test compound with a CPP comprising the amino acid sequence selected 
from one of the groups consisting of SEQ ID NOs: 1-4, 5-12, 13-19 and 20-25 under 
sample conditions permissive for at least one CPP biological activity; 

ii) deterrnining the level of said at least one CPP biological activity; 

iii) comparing said level to that of a control sample lacking said test compound; and 

iv) selecting a test compound which causes said level to change for further testing as a 
CPP modulator for the prophylactic and/or therapeutic treatment of cardiovascular 
disorders. 
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14. A method of identifying a modulator of a cardiovascular disorder comprising the steps of. 

(a) administering a candidate agent to a con-human test animal which is predisposed to be 
affected or which is affected by the cardiovascular disorder, 

(b) administering the candidate agent of (a) to a matched control non-human animal not 
predisposed to be affected or not being affected by the cardiovascular disorder; 

(c) detecting and /or quantifying the level of a polypeptide in a biological sample obtained 
from the non-human test animal of step (a) and from the control animal of step (b), 
wherein the polypeptide is selected from: 

i) a polypeptide comprising the amino acid sequence selected from one of the 
groups consisting of SEQ ID NO: 1-2, 5-6, 13, and 20-21; 

ii) a variant, with at least 75% sequence identity, having one or more amino acid 
substitutions, deletions or insertions relative to the amino acid sequence shown in 
SEQ ID NOs: 1-2, 5-6, 13, and 20-21; and 

iii) a Fragment of a polypeptide as defined in i) or ii) above which is a least ten 
amino acids long; and 

(d) comparing the levels of the polypeptide of step (c); wherein a displacement of the level 
of the polypeptide in the biological sample obtained from the non-human test animal 

animal indicates that the candidate agent is a modulator of the cardiovascular disorder. 

15. The method of claim 14, wherein the non-human test animal which is predisposed to be 
affected or which is affected by the cardiovascular disorder comprises an increased plasma 
level of a polypeptide selected from: 

i) a polypeptide comprising die amino acid sequence selected from one of the 
groups consisting of SEQ ID NO: 1-2, 5-6, 13, and 20-21; 

ii) a variant, with at least 75% sequence identity, having one or more amino acid 
substitutions, deletions or insertions relative to the amino acid sequence shown 
in SEQ ID NOs: 1-2, 5-6, 13, and 20-21; and 

iii) a fragment of a polypeptide as defined in i) or ii) above which is a least ten 
amino acids long. 
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16. A method for monitoring the efficacy of a treatment of a subject having ox at risk of developing 
a cardiovascular disorder with an agent, the method comprising: 

(a) obtaining a pre-administration biological sample from the subject prior to 
administration of the agent; 

(b) detecting and /or quantifying the level of a polypeptide in the biological sample from 
said subject, wherein the polypeptide is selected from: 

0 a polypeptide comprising the amino acid sequence selected from one of the groups 
consisting of SEQ ID NO: 1-2, 5-6, 13, and 20-21; 

ii) a variant, with at least 75% sequence identity, having one or more amino acid 
substitutions,' deletions or insertions relative to the amino acid sequence shown in 
SEQ ID NOs: 1-2, 5-6, 13, and 20-21; and 

iii) a fragment of a polypeptide as defined in i) or ii) above which is a least ten amino 
acids long; and 

(c) obtaining one or more post-administration biological samples from the subject; 

<d) detecting the level of the polypeptide in the post-administration sample or samples; 

(e) comparing the level of the polypeptide in the pre-administration sample with the level 
of the polypeptide in the post- administration sample; and 
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SEQ ID NO:l 

1 MRTIAILAAI LLVALOAOAE SLQERADEAT TQ KQSGEDNQ DLAISFAGNG LSALR TSGSO 
61 ARATCYCRTG RCAT RESLSG VCEISGRL YR LCCR 

SEQ ID NO:2 

1 ESLQERADEA TTQ KQSGEDN QDLAISFAGN GLSALR TSGS QARATCYCRT GRCAT RESLS 
61 GVCEISGRLY RLCCR 

SEQ ID NO:3 

QSGEDN QDLAISFAGN GLSALR 

SEQ ID NO:4 

ESLSG VCEISGR 
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Figure 2 

SEQ ID NO:5 

1 MGIOGGSVLF GLLLVLAVFC HSGHSLQCYN CPNPTADCKT AVNCSSDFDA CLITKAGLQV 
61 YNKCWKFEHC NFNDVTTRLR ENELTYYCCK KDLCNFNEQL E1 SGGTSLSEK TVLLLVTPFL 
121 AAAWSLHP 

SEQ ID NO:6 

1 LQCYNCPNPT ADCKTAVNCS SPFDACLITK AGLQVYNKCW KFEHCNFNDV TTRLRENELT 
61 YYCCKKDLCN FNEQL EN 

SEQ ID NO:7 

LQCYNCPNPT ADCK 

SEQ ID NO:8 

TAVNCSSDFD ACLITK 

SEQ ID NO:9 

AGLQVYNK 

SEQ ID NO;lt> 

FEHCNFNDVT TR 

SEQ ID NO: 11 

LRENELTYYC CK 

SEQ ID NO:12 

ENELTYYC CK 
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■ 

Figure 3 

SEQ ID NO:13 

1 MAVKKIAIFG ATGQTGLTTL AQAVQAGYEV TVLVRDSSRL PSEGPRPAHV WGDVLQAAD 
61 VDKTV&GQDA VIVLLGTRND I^SPTTVMSEG ARNIVAAMKA HGVDKWACT SAFLLWDPTK 
121 VPP KLQAVTO DHIRM HKVLR ESGLKYVAVM PPHIGDQFLT GAYTVTLPGR GPSRVISKHD 
181 I/5HBMLRGLT TDEYDGHSTY PSHQYQ 

SEQ ID NO:14 

TVAGQDAVIV LLGTR 

SEQ ID NO:15 

NDLSPTTVMS EGAR 

SEQ ID NO:16 

LQAVTDDHIR 

SEQ ID NO:17 

YVAVMPPHIG DQPLTGAYTV TLDGR 

SEQIDNG:!* 

HDLGHFMLR 

SEQ ID NO:19 

CLTTBEYDGH STYPSHQYQ 
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SEQ ID NO:20 

1 MSLLGPKVLL FLAAFIITSD W IPLGVNSQR GDDVTQATPE TFTEDPNLVN DPATDETVLA 

61 VLADIAPSTD DLASLSEKNT TAECWDEKFT CT RLYSVHRP VKQCIHQLCF TSLRR MYIVN 

121 KEICSRLVCK EHEAMKDELC RQMAGLPPRR LR RSNYFRLP PCEKVDLQRP NGL 

■ 

SEQ ID NO:21 

1 IPLGVNSQRG DDVTQATPET FTEDPNLVND PATDETVLAV LADIAPSTDD IASLSEKNTT 
61 AECWDEKFTC T RLYSVHRPV KQCIHQLCFT SLRR MYIVNK EICSRLVCKE KEAMKDELCR 
121 QMAGLPPRRL RRSNYFRLPP CENVDLQRPN GL 

SEQ ID NO:22 

LYSVHRPVK 

SEQIDNO:23 

QCIHQLCFTS LR 

SEQ ID NO:24 



SEQ ID NO:25 

LPPCENVDLQ RPNGL 
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4-33629 (5038-WOOD.ST25.txt 
SEQUENCE LISTING 



<110> Geneva Ltd. 

Bouguel e ret , Lydi e 



<120> SECRETED POLYPEPTIDE SPECIES ASSOCIATED WITH CARDIOVASCULAR DISORDERS 
<130> 4-33629A/GLT C5038-WO01) 

<150> US 60/475,121 
<151> 2003-05-30 

<150> US 60/475,082 
<151> 2003-05-30 

<150> US 60/475 f 083 
<151> 2003-05-30 

<150> US 60/484,102 
<151> 2003-06-30 

<160> 141 

<170> Patentln version 3.1 

<210> 1 

<211> 94 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Arg Thr lie Ala lie Leu Ala Ala lie Leu Leu Val Ala Leu Gin 
15 10 15 

Aia vain Aia tuu ser Leu i*ut cnu Am ntct *.=>u *rru *ra utr inr uUt 

20 25 30 

Lys Gin ser Gly Glu Asp Asn Gin Asp Leu Ala lie Ser Phe Ala Gly 
35 40 45 

Asn Gly Leu ser Ala Leu Arg Thr ser Gly Ser Gin Ala Arg Ala Thr 
50 55 60 

Cys Tyr cys Arg Thr Gly Arg cys Ala Thr Arg Glu Ser Leu Ser Gly 
65 70 75 80 

val cys Glu lie ser Gly Arg Leu Tyr Arg Leu cys Cys Arg 

85 90 

<210> 2 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Glu ser Leu Gin Glu Arg Ala Asp Glu Ala Thr Thr Gin Lys Gin ser 
1 5 10 15 



Gly Glu Asp Asn Gin Asp Leu Ala lie ser Phe Ala Gly Asn Gly Leu 
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4-33629 C5038-WOOD.ST25.txt 
20 25 30 

Ser Ala Leu Arg Thr ser Gly ser Gin Ala Arg Ala Thr Cys Tyr Cys 
35 40 45 

Arg Thr Gly Arg cys Ala Thr Arg Glu ser Leu Ser Gly val Cys Glu 
50 55 60 

lie Ser Gly Arg Leu Tyr Arg Leu Cys cys Arg 
65 70 75 

<210> 3 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Gin ser Gly Glu Asp Asn Gin Asp Leu Ala lie Ser Phe Ala Gly Asn 
1 5 10 i«; 



Gly Leu ser Ala Leu Arg 

20 



<210> 4 

<211> 12 

<212> PRT 

<213> Homo sapiens 



Glu ser Leu ser Gly val cys Glu lie Ser Gly Arq 
1 5 in * 



10 

<210> 5 

<211> 128 

<212> prt 

<213> Homo sapiens 

<400> 5 

Met Gly He Gin Gly Gly ser val Leu Phe Gly Leu Leu Leu val Leu 
15 10 15 

Ala val Phe Cys His ser Gly His ser Leu Gin cys Tyr Asn cys Pro 

20 25 30 

Asn Pro Thr Ala Asp cys Lys Thr Ala Val Asn cys Ser ser Asp Phe 
35 40 45 

Asp so* Cys Leu Il€ Thr 55 s Ala Gly Leu Gln ^q 1 Tyr Asn L y s C V S 

Trp Lys Phe Glu His cys Asn Phe Asn Asp val Thr Thr Arg Leu Arg 
65 70 75 80 
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4-33629 (5038-W001). ST25.txt 
Glu Asn Glu Leu Thr Tyr Tyr Cys Cys Lys Lys Asp Leu Cys Asn Phe 

85 90 95 

Asn Glu Gin Leu Glu Asn Gly Gly Thr Ser Leu Ser Glu Lys Thr val 

100 105 110 

Leu Leu Leu val Thr pro Phe Leu Ala Ala Ala Trp Ser Leu His Pro 
115 120 125 

<21Q> 6 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Leu Gin Cys Tyr Asn Cys Pro Asn Pro Thr Ala Asp Cys Lys Thr Ala 
15 10 15 

val Asn cys ser Ser Asp Phe Asp Ala cys Leu lie Thr Lys Ala Gly 

20 25 30 

Leu Gin val Tyr Asn Lys Cys Trp Lys Phe Glu His cys Asn Phe Asn 
35 40 45 

Asp val Thr Thr Arg Leu Arg Glu Asn Glu Leu Thr Tyr Tyr cys cys 
50 55 60 



fai /u o 

<210> 7 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Leu Gin Cys Tyr Asn Cys Pro Asn Pro Thr Ala Asp Cys Lys 
15 10 

<210> 8 

<211> 16 

<212> PRT 

<213> ftomo sapiens 

<400> 8 

Thr Ala val Asn Cys Ser ser Asp Phe Asp Ala cys Leu lie Thr Lys 
1 5 10 15 

<210> 9 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 9 
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Ala nv i a\i n« va i -r, A 4-33629 (5038-WO0D.ST25.txt 
Aia Gly Leu Gin Val Tyr Asn Lys 

1 5 



<210> 10 

<211> 12 

<212> PRT 

<213> Homo sapiens 



<400> 10 

Phe Glu His cys Asn Phe Asn Asp val Thr Thr Arg 
1 5 10 

<210> 11 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Leu Arg Glu Asn Glu Leu Thr Tyr Tyr Cys Cys Lys 

1 5 .10 

<210> 12 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Glu Asn Glu Leu Thr Tyr Tyr cys Cys Lys 
1 5 in 

<210> 13 

<211> 206 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Met Ala val Lys Lys He Ala He Phe Gly Ala Thr Gly Gin Thr Gly 

5 io 15 

Leu Thr Thr Leu Ala Gin Ala val Gin Ala Gly Tyr Glu val Thr val 

zo 25 30 

Leu Val Arg Asp ser Ser Arg Leu Pro ser Glu Gly Pro Arg Pro Ala 

40 45 

His val val val Gly Asp Val Leu Gin Ala Ala Asp Val Asp Lys Thr 

val Ala Gly Gin Asp Ala val He Val Leu Leu Gly Thr Arg Asn Asp 

/0 75 80 

Leu ser Pro Thr Thr Val Met Ser Glu Gly Ala Arg Asn He Val Ala 

90 " 95 
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A , M + , 4 , , 4-33629 C5038-WOOD.ST25.txt 

Ala Met Lys Ala His Gly val Asp Lys val val Ala cys Thr ser Ala 

100 105 HO 

Phe Leu Leu Trp Asp pro Thr Lys Val Pro Pro Arg Leu Gin Ala val 
115 120 125 

Thr Asp Asp His lie Arg Met His Lys val Leu Arg Glu ser Gly Leu 
130 135 140 

Lys Tyr val Ala Val Met Pro Pro His lie Gly Asp Gin Pro Leu Thr 
145 150 155 160 

Gly Ala Tyr Thr Val Thr Leu Asp Gly Arg Gly Pro Ser Arg Val He 

165 170 175 

ser Lys His Asp Leu Gly His Phe Met Leu Arg Cys Leu Thr Thr Asp 

180 185 ' * 190 

Glu Tyr Asp Gly His ser Thr Tyr pro ser His Gin Tyr Gin 
195 200 20! 



(5 



<210> 14 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 14 

1 5 10 15 

<210> 15 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 15 

Asn Asp Leu ser pro Thr Thr val Met Ser Glu Gly Ala Arg 
1 5 10 

<210> 16 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Leu Gin Ala val Thr Asp Asp His lie Arg 
1 5 10 

<210> 17 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 17 
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4-33629 C5038-WO01) . ST25 . txt 
Tyr Val Ala val Met Pro Pro His lie Gly Asp Gin Pro Leu Thr Gly 
15 10 15 

Ala Tyr Thr val Thr Leu Asp Gly Arg 

20 25 

<210> 18 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 18 

His Asp Leu Gly His Phe Met Leu Arg 
1 5 

<210> 19 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<4O0> 19 

Cys Leu Thr Thr Asp Glu Tyr Asp Gly His Ser Thr Tyr Pro Ser His 
15 10 IS 

Gin Tyr Gin 



<210> 20 

<213> Homo sapiens 
<400> 20 

Met Ser Leu Leu Gly pro Lys val Leu Leu Phe Leu Ala Ala Phe lie 
15 10 15 

He Thr ser Asp Trp lie Pro Leu Gly val Asn Ser Gin Arg Gly Asp 

20 25 30 

Asp Val Thr Gin Ala Thr Pro Glu Thr Phe Thr Glu Asp Pro Asn Leu 
35 40 45 

val Asn Asp Pro Ala Thr Asp Glu Thr Val Leu Ala val Leu Ala Asp 
50 55 60 

lie Ala Pro Ser Thr Asp Asp Leu Ala Ser Leu ser Glu Lys Asn Thr 
65 70 75 80 

Thr Ala Glu cys Trp Asp Glu Lys Phe Thr Cys Thr Arg Leu Tyr Ser 

85 90 ^ 95 

val His Arg pro val Lys Gin Cys He His Gin Leu Cys Phe Thr Ser 

100 105 110 
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Leu Arg Arg Met Tyr lie val Asn Lys Glu He Cys Ser Arg Leu Val 
115 120 125 

Cys Lys Glu His Glu Ala Met Lys Asp Glu Leu cys Arg Gin Met Ala 
130 135 140 

Gly Leu Pro Pro Arg Arg Leu Arg Arg Ser Asn Tyr Phe Arg Leu Pro 
145 150 155 160 

Pro cys Glu Asn val Asp Leu Gin Arg pro Asn Gly Leu 

165 170 

<210> 21 

<211> 152 

<212> PRT 

<213> Homo sapiens 

<400> 21 

lie Pro Leu Gly val Asn ser Gin Arg Gly Asp Asp val Thr Gin Ala 
15 10 15 

Thr Pro Glu Thr Phe Thr Glu Asp Pro Asn Leu val Asn Asp Pro Ala 

20 25 30 

Thr Asp Glu Thr val Leu Ala val Leu Ala Asp lie Ala Pro ser Thr 
35 40 45 

Asp Asp Leu Ala ser Leu Ser *Hu t.ys Asn rnr rnr Aia uiu cy* icy 
50 55 60 

Asp Glu Lys Phe Thr Cys Thr Arg Leu Tyr Ser val His Arg Pro Val 
65 70 75 80 

Lys Gin Cys lie His Gin Leu Cys Phe Thr ser Leu Arg Arg Met Tyr 

85 90 95 

lie val Asn Lys Glu He cys ser Arg Leu Val Cys Lys Glu His Glu 

100 105 110 

Ala Met Lys Asp Glu Leu cys Arg Gin Wet Ala Gly Leu Pro Pro Arg 
115 120 125 

Arg Leu Arg Arg ser Asn Tyr Phe Arg Leu Pro Pro Cys Glu Asn val 
130 135 140 

Asp Leu Gin Arg Pro Asn Gly Leu 
145 150 

<210> 22 

<211> 9 

<212> PRT 

<213> Homo sapiens 
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<400> 22 

Leu Tyr ser val His Arg pro Val Lys 
1 5 

<210> 23 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 23 

Gin cys He His Gin Leu cys Phe Thr Ser Leu Arg 
1 5 10 

<210> 24 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 24 

ser Asn Tyr Phe Arg Leu Pro pro Cys Glu Asn val Asp Leu Gin Arg 
1 5 10 15 

Pro Asn Gly Leu 

20 

<210> 25 
<400> 25 

Leu Pro Pro cys Glu Asn val Asp Leu Gin Arg pro Asn Gly Leu 
15 10 15 

<210> 26 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 26 

cys Thr Ser Met Ala Ser Glu Asn Ser Glu cys Ser val Lys 
15 10 

<210> 27 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Ser Asn Cys Cys Gin His Ser ser Ala Leu Gly Leu Ala Arg 
15 10 

<210> 28 
<211> 19 
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<212> PRT 

<213> Homo sapiens 

<40O> 28 

Thr lie Val Gly ser He Thr Asn Thr Asn Phe Gly lie Cys His Asp 
1 5 in 



Ala Gly Arg 



<210> 29 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 29 



cys i_eu Asp Pro val Asp Thr Pro Asn Pro Thr Arg 
1 5 10 



<210> 30 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 30 



Tyr Lys Lys Pro Glu Cys Gin ser Asp Trp Gin Cys Pro Gly Lys 
1 5 in i c 



5 10 15 

<^XJ> 14 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Asp Pro Pro Gin Tyr Pro Val val pro val His Leu Asp Arg 
1 5 10 

<210> 32 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Arg Asp Pro Pro Gin Tyr Pro val val Pro Val His Leu Asp Arg 
1 5 10 15 

<210> 33 

<2U> 17 

<212> prt 

<213> Homo sapiens 

<400> 33 

ryr Ala Gin Thr Pro Ala Asn Met Phe Tyr lie Val Ala Cys Asp Asn 
1 5 10 15 
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<210>~ 34 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Asp Ala Leu ser Ala ser val val i_ys 
15 



<210> 35 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Asp Ser Gly Glu Asp Pro Ala Thr cys Ala Phe Gin Arg 
15 10 



<210> 36 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 36 

Asp Tyr Tyr val ser Thr Ala Val Cys Arg 



<210> 37 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 37 

Phe Pro val Tyr Asp Tyr Asp Pro ser ser Leu Arg 
1 5 10 

<210> 38 

<211> 12 

<212> PRT 

<213> tfomo sapiens 

<400> 38 

val Asn ser Gin Ser Leu Ser Pro Tyr Leu Phe Arg 
1 5 10 

<210> 39 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Val Ser Ala Gin Gin val Gin Gly val His Ala Arg 
1 5 io 
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<210> 40 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Gly val ser Leu Arg Pro He Gly Ala Ser Cys Arg 
15 10 

<210> 41 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 41 

Gly val Ser Leu Arg pro lie Gly Ala ser cys Arg Asp Asp Ser Glu 
15 10 15 

Cys lie Thr Arg 

20 

<210> 42 

<211> 31 

<212> PRT 

<213> fiomo sapiens 

<400> 42 

1 5 ±<j ±> 

Glu Val val val ser Leu Ala Trp Asp Glu Ser Leu Ala Pro Lys 

20 25 30 

<210> 43 

<2U> 9 

<212> PRT 

<213> Homo sapiens 

<400> 43 

lie Pro Ala Cys lie Ala Gly Glu Arg 

<210> 44 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 44 

lie Pro Ala cys lie Ala Gly Glu Arg Arg 
1 5 10 



<210> 45 
<211> 6 
<212> PRT 
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<213> Homo sapiens 
<400> 45 

Leu Trp Ala Phe cys cys 
1 5 
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<210> 46 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<40O> 46 

Arg Tyr Gly Thr cys lie Tyr Gin Gly Arg 



<210> 47 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<:400> 47 

Tyr Gly Thr cys lie Tyr Gin Gly Arg 
1 5 



<210> 48 

<211> 15 

<212> PRT 

<213> Homo sapiens 



Tyr Gly Thr cys lie Tyr Gin Gly Arg Leu Trp Ala Phe cys Cys 
15 10 15 



<21€> 49 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Ala Gly Pro Ala Gin Thr Leu He Arg Pro Gin Asp Met Lys 
15 10 



<21Q> 50 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Met Ser Ser ser Tyr Pro Thr Gly Leu Ala Asp Val Lys 
1 5 10 



<210> 51 

<2U> 27 

<212> PRT 

<213> Homo sapiens 
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Met Ser ser ser Tyr Pro Thr Gly Leu Ala Asp Val Lys Ala Gly Pro 
15 10 15 

Ala Gin Thr Leu lie Arg Pro Gin Asp Met Lys 

20 '25 

<210> 52 

<211> 7 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Ala Phe Gin Tyr His Ser Lys 

<210> 53 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 53 

Cys Glu Glu Asp Lys Glu Phe Thr cys Arg 
1 5 10 

<210> 54 
<211> 19 
<212> PRT 

<400> 54 

Glu Pro Leu Asp Asp Tyr val Asn Thr Gin Gly Pro ser Leu Phe Ser 

5 10 15 

Val Thr Lys 



<210> 55 

<211> 13 

<212> prt 

<213> Homo sapiens 

<400> 55 

Glu Asp Pro Thr Val Ser Ala Leu Leu Thr ser Glu Lys 
1 5 10 

<210> 56 

<211> 10 

<212> prt 

<213> Homo sapiens 



<400> 56 

val Pro ser Leu val Gly ser phe lie Arg 
15 in 



10 
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<210> 57 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Cys Leu His Pro cys val lie Ser Arg 
15 

<210> 58 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 58 

Glu Ala Thr Phe Cys Asp Phe Pro Lys 
1 5 

<21Q> 59 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Glu II* Met Glu Asn Tyr Asn He Ala Leu Arg 
1 5 10 



<212> PRT 

<213> Homo sapiens 
<400> 60 

Gly Trp ser Thr Pro Pro Lys 
1 5 



<210> 61 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<4G0> 61 

lie Asn His Gly He Leu Tyr Asp Glu Glu Lys 
1 5 10 



<210> 62 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 62 

lie Thr cys Thr Glu Glu Gly Trp ser Pro Thr Pro Lys 
1 5 10 



<210> 63 
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<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 63 

Leu Glu Tyr Pro Thr Cys Ala Lys 
1 5 

<210> 64 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 64 

Leu Gin Asn Asn Glu Asn Asn lie ser cys Val Glu Ara 
1 5 10 

<210> 65 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 65 

Asn Gly Gin Trp ser Glu Pro Pro Lys 
15 

<210> 66 
<211> 18 



<400> 66 

Asn Gly Gin Trp ser Glu Pro Pro Lys cys Leu His Pro Cys Val ll« 
1 5 10 15 



Ser Arg 



<210> 67 

<211> 5 

<212> prt 

<213> Homo sapiens 

<400> 67 

Ser Phe Trp Thr Arg 
1 5 



<210> 68 

<211> 18 

<212> PRT 

<213> Homo sapiens 



<400> 68 

ser Phe Trp Thr Arg lie Thr cys Thr Glu Glu Gly Trp ser Pro Thr 
1 5 10 15 
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Pro Lys 

<210> 69 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 69 

Ser Thr Asp Thr ser cys Val Asn Pro Pro Thr val Gin Asn Ala His 
1 5 in iq 



lie Leu Ser Arg 

20 



<210> 70 

<211> 10 

<212> PRT 

<213> Homo sapiens 



<400> 70 

Thr Gly Glu ser Ala Glu Phe Val cys Lys 
1 5 10 



<210> 71 
<211> 11 
<212> PRT 



<4UU> /X 

Thr Gly Glu ser Ala Glu Phe Val cys Lys Arq 
I 5 10 

<210> 72 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 72 

Thr Thr cys Trp Asp Gly Lys Leu Glu Tyr Pro Thr Cys Ala Lys 
15 10 15 

<210> 73 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 73 

Tyr Lys Pro Phe Ser Gin val pro Thr Gly Glu Val Phe Tyr Tyr Ser 
1 5 10 15 

cys Glu Tyr Asn Phe val ser Pro ser Lys 

20 25 
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<210> 74 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 74 

Ala Phe Thr Glu Cys cys val val Ala ser Gin Leu Arg 
15 10 

<210> 75 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 75 

Cys Cys Tyr Asp Gly Ala cys val Asn Asn Asp Glu Thr Cys Glu Gin 
15 10 15 

Arg 



<210> 76 

<211> 17 

<212> PRT 

<213> Horao sapiens 

<400> 76 

Asp val Val Gin xle Thr cys Leu Asp Gly Phe Glu val Val Glu Gly 



Arg 



<210> 77 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 77 

Glu Asp Thr Pro Asn Ser Val Trp Glu Pro Ala Lys 
15 10 

<210> 78 

<211> 15 

<212> PRT 

<213> Horao sapiens 

<400> 78 

Gly Asp Ser Gly Gly Ala Phe Ala Val Gin Asp Pro Asn Asp Lys 
15 10 15 

<210> 79 

<211> 19 

<212> PRT 

<213> Homo sapiens 
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<400> 79 

Gin Phe Gly Pro Tyr Cys Gly His Gly Phe Pro Gly Pro Leu Asn lie 
15 10 15 

Glu Thr Lys 



<210> 80 

<211> 16 

<212> PRT 

<213> Homo sapiens' 

<400> 80 

Ser ASfi Ala Leu Asp lie lie Phe Gin Thr Asp Leu Thr Gly Gin Lys 
15 10 15 

<210> 81 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 81 

Ser Ser Asn Asn Pro His ser Pro lie val Glu Glu Phe Gin val Pro 
15 10 15 

Tyr Asn Lys 



<210> *z 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 82 

Thr Asn Phe Asp Asn Asp lie Ala Leu val Arg 
1 5 10 



<210> 83 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 83 

val Glu Asp Pro Glu ser Thr Leu Phe Gly ser val lie Arg 
1 5 10 



<210> 84 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 84 

Leu Ala Glu Leu Pro Ala Asp Ala Leu Gly Pro Leu Gin Arg 
1 5 H 10 
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<210> 85 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<40Q> 85 

Leu Ala Tyr Leu Gin Pro Ala Leu Phe Ser Gly Leu Ala Glu Leu Arg 
1 5 io is 



<210> 86 

<211> 14 

<212> PRT 

<213> Homo sapiens 



<400> 86 

Leu Glu Ala Leu Pro Asn Ser Leu Leu Ala Pro Leu Gly Arg 
1 5 10 



10 

<210> 87 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 87 

Ser Phe Glu Gly Leu Gly Gin Leu Glu Val Leu Tfir Leu Asp «is Asn 
15 10 15 

n i on r.ln f:1n i ^»«* 



<210> 88 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 88 

val Ala Gly Leu Leu Glu Asp Thr Phe Pro Gly Leu Leu Gly Leu Arg 

1 5 m « * 



<210> 89 
13 



PRT 

<213> Homo sapiens 
<400> 89 

cys Phe Leu Gly cys Glu Leu Pro Pro Glu Gly ser Arg 
1 5 10 

<210> 90 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 90 

Glu Phe Leu Glu Asp Thr Cys val Gin Tyr val Gin Lys 
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15 10 

<210> 91 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 91 

Thr Gin Ser Gly Leu Gin Ser Tyr Leu Leu Gin Phe His Gly Leu val 
15 10 15 

Arg 



<210> 92 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 92 

Ala Leu Asn Ser lie lie Asp Val Tyr His Lys 
15 10 



<210> 93 

<211> 7 

<212> PRT 

<213> Homo sapiens 



Gly Ala Asp val Trp phe Lys 
1 5 



<210> 94 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 94 

Gly Asn Phe His Ala val Tyr Arg 
1 5 



<210> 95 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 95 

Leu Leu Glu Thr Glu Cys Pro Gin Tyr lie Arg 
1 5 10 



<210> 96 

<211> 7 

<212> PRT 

<213> Homo sapiens 

<400> 96 
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Met Leu Thr Glu Leu Glu Lys 
1 5 



<210> 97 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 97 

Ala lie Asp Gly lie Asn Gin Arg 
1 5 



<210> 98 

<211> II 

<212> PRT 

<213> Homo sapiens 

<400> 98 

Cys Met Gly Thr val Thr Leu Asn Gin Ala Arg 
i 5 10 



<210> 99 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 99 



<210> 100 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 100 

Phe Ala Leu Leu Gly Asp Phe Phe Arg Lys 
15 10 



<210> 101 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 101 

Gly ser Phe Asp lie Ser cys Asp Lys 
1 5 



<210> 102 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 102 

Gly ser Phe Asp lie ser cys Asp Lys Asp Asn Lys 
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10 



<210> 
<211> 
<212> 



103 
6 

PRT 

Homo sapiens 



<400> 103 

lie Lys Asp Phe Leu Arg 
1 5 



<210> 104 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 104 

Gin Val Leu Ser Tyr Lys Glu Ala val Leu Arg 
1 5 10 



<210> 105 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 105 

Thr Thr Gin Gin Ser Pro Glu Asp Cys Asp Phe Lys 
1 5 10 



<210> 106 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 106 

Thr Thr Gin Gin Ser Pro Glu Asp Cys Asp Phe Lys Lys 
1 5 10 



<210> 107 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 107 

cys Asp Tyr Trp lie Arg 



<210> 108 

<211> 7 

<212> PRT 

<213> Homo sapiens 

<400> 108 

Glu Leu Thr Ser Glu Leu Lys 
1 5 
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<210> 109 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 109 

Met Tyr Ala Thr lie Tyr Glu Leu Lys 
1 5 

<210> 110 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 110 

Ser Leu Gly Leu Pro Glu Asn His lie val Phe Pro Val Pro lie Asp 
15 10 15 

Gin tys He Asp Gly 

20 

<210> 111 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 111 



<210> 112 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 112 

Thr Phe val Pro Gly Cys Gin Pro Gly Glu Phe Thr Leu Gly Asn lie 
15 10 15 



Lys 



<210> 113 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 113 

Val Pro Leu Gin Gin Asn Phe Gin Asp Asn Gin Phe Gin Gly Lys 
15 10 15 

<210> 114 

<211> 15 

<212> prt 

<213> Homo sapiens 
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<400> 114 

Val val Ser Thr Asn Tyr Asn Gin His Ala Met Val Phe Phe Lys 
15 10 15 

<210> 115 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 115 

Trp Tyr val Val Gly Leu Ala Gly Asn Ala lie Leu Arg 
15 10 

<210> 116 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 116 

Ala Trp Met Glu Thr Glu Asp Thr Leu Gly Arg 
15 10 

<210> 117 

<211> 14 

<212> PRT 

<213> Homo sapiens 



Asp Asp <5ln Leu val Val teu Phe Pro Val Gin Lys Pro Lys 
15 10 



<210> 118 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 118 

Gly Pro lie Leu Pro Gly Thr Lys 
1 5 



<210> 119 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 119 

His Trp pro Ser Glu Gin Asp Pro Glu Lys 
1 5 10 



<210> 120 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 120 
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His Trp Pro Ser Glu Gin Asp Pro Glu Lys Ala Trp Gly Ala Arg 
15 10 15 

<210> 121 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 121 

Leu Leu Thr Thr Glu Glu Lys Pro Arg 
1 5 

<210> 122 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 122 

Leu Leu Thr Thr Glu Glu Lys Pro Arg Gly Gin Gly Arg 
15 10 

<210> 123 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 123 

i — — — \t-^\ &*~-v nv.^ ft <r u-ir f "1 1 1 et« » on rz~\ \ t Dr-o <tl M (Z~\ \ i Acn 



<;ln Asp His He Tyr His Pro Gin 

20 

<210> 124 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 124 

Val Leu ser Pro Glu Pro Asp His Asp Ser Leu Tyr His Pro Pro Pro 
1 5 10 15 

Glu <Slu Asp <3ln Gly Glu Glu Arg Pro Arg 

20 25 

<210> 125 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 125 

Val val Glu Pro Pro Glu Lys Asp Asp Gin Leu val val Leu Phe Pro 
15 10 15 
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Val Gin Lys Pro Lys 

20 
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<210> 126 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 126 

Glu val Met Pro Ser lie Glh Ser Leu Asp Ala Leu val Lys 
15 10 



<210> 127 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 127 

Gly Leu Met Tyr Ser val Asn Pro Asn Lys 
1 5 10 



<210> 128 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 128 

Asn Ala Asn Thr Phe lie Ser Pro Gin Gin Arg 



<210> 129 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 129 

Tyr Glu Ser His Glu Ser Met Glu Ser Tyr Glu Leu Asn Pro Phe lie 
15 10 15 



Asn Arg Arg 



<210> 130 
13 
PRT 

<213> Homo sapiens 
<400> 130 

Ala Pro Gin Thr Gly lie val Asp Glu Cys cys Phe Arg 
15 10 



<210> 131 

<211> 15 

<212> PRT 

<213> Homo sapiens 
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<400> 131 

Gly phe Tyr Phe Asn Lys Pro Thr Gly Tyr Gly Ser ser ser Arg 
15 10 15 

<210> 132 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 132 

Gly pro Glu Thr Leu cys Gly Ala Glu Leu val Asp Ala Leu Gin Phe 
15 10 15 

val cys Gly Asp Arg 

20 



<210> 


133 


<213> 


12 


<212> 


PRT 


<213> 


Homo 


<400> 


133 


Leu Glu Met 


1 




<210> 


134 


<211> 


14 




"DT 


<400> 


134 


Arg Ala Pro 


1 




<210> 


135 


<211> 


13 


<212> 


PRT 


<213> 


Homo 


<400> 


135 


Arg Leu Glu 
1 


<210> 


136 


<211> 


22 


<212> 


PRT 


<213> 


Homo 


<400> 


136 


Ala Gin Glu 



10 



10 



10 



Ala Gin Glu Pro val Lys Gly Pro Val ser Thr Lys Pro Gly ser cys 
15 10 15 



Pro lie lie Leu lie Arg 

20 
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<210> 137 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<4O0> 137 

Cys Ala Met Leu Asn Pro Pro Asn Arq 
15 

<210> 138 

<2U> 11 

<212> PRT 

<213> Homo sapiens 

<400> 138 

Cys Leu Lys Asp Thr Asp Cys Pro Gly lie Lys 
1 5 10 

<210> 139 

<211> 16 

<212> PRT 

<213> Horao sapiens 

<400> 139 

Gly Pro Val ser Thr Lys Pro Gly ser cys Pro He He Leu He Arq 
1 5 io 15 y 

<212> PRT 

<213> Homo sapiens 
<400> 140 

val Pro Phe Asn Gly Gin Asp Pro val Lys 
1 5 io 

<210> 141 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 141 

Val Pro Phe Asn Gly Gin Asp Pro Val Lys Gly Gin Val Ser Val Lys 
1 5 10 15 
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